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(54) Title; TRANSGENIC ANIMALS FOR ANALYSING CYP3A4 CYTOCHROME P450 GENE REGULATION 

CO 

(57) Abstract: The invention re hues to the generation of non-human transgenic animals comprising a reporter construct tor produc- 
ing a detectable amount of a reporter molecule operably linked to a transcriptional regulatory nucleic acid molecule from the human 
CYP3A4 gene located between the initiation of transcription site of the gene and a position located 13.000 nucleotides upstream 
from the site. The invention also relates to the use of these animals for determining the effect of a compound, particularly, but not 
r exclusively, a xenobiotic or steriod, on the regulation of expression of the CYP3A4 gene in a human. 
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TECHNICAL FIELD OF THE INVENTION 

The invention relates to the generation of a transgenic 
animal and to the use of the animal for determining the 
effect of a compound, particularly, but not exclusively, a 
xenobiotic or steroid, on the regulation of expression of 
a P4 50 gene in a human. 

BACKGROUND OF THE INVENTION 



Many endogenous and exogenous compounds are observed to 
have a therapeutic effect in drug development trials in 
vitro. However, the intended therapeutic effect is often 

15 not realised in clinical practice, for example, when 

compounds are co-administered, because certain compounds 
induce the expression of the CYP3A4 gene. This induction 
generates CYP3A4 cytochrome P450 molecules which 
metabolise compounds before the intended therapeutic 

20 effect of each compound can- be realised. Accordingly, 

induction of expression of the CYP3A4 gene interferes with 
intended dosage, leading to therapeutic failure or sub- 
optimal treatment . 



25 Induction of CYP3A4 gene expression is a significant' 

problem for drug development because time, resources and 
expense are wasted in the development of candidate drugs 
for therapy of particular disease conditions which will 
ultimately fail or perform sub-optimally in clinical 

30 practice . 

It would be advantageous to have an animal model for usq. 
in drug development trials from which, at an early stage 
of drug development, one could determine whether a 
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candidate drug would be likely to achieve an intended 
therapeutic effect in a human. 

Such an animal model would not be useful unless at least 
5 'some of the aspects of the regulation of CYP3A4 gene 
expression in the human, especially tissue specific- 
expression, are reproduced. This is because in the human, 
the CYP3A4 gene is expressed in specific tissues, 
including liver and small intestine, v/hich many compounds 

10 inevitably come into contact with when administered, for 

the purpose of therapy. Accordingly, one would be unable 
to determine whether the bio-availability of a candidate 
drug would be sufficient for achieving an intended 
therapeutic effect in clinical practice in a model which 

15 does not reproduce the constitutive and xenobiotic induced 
tissue specific expression of the CYP3A4 gene that -is 
observed in the human. 

W099/61622 and Goodwin et al . 1999 disclose a nucleic acid 
20 molecule located 8 kb upstream from the initiation of 
transcription site of the CYP3A4 gen.e which regulates 
transcription of the CYP3A4 gene in response to xenobiotic 
compounds. These documents do not disclose elements for 
regulating the constitutive and xenobiotic inducible 
25 tissue specific and developmental expression of the CYP3A4 
gene observed in a human. 

There is a need for an animal model which reproduces at 
least some aspects of the expression of the CYP3A4 gene in 
3 0 a human, for determining whether a compound, for example, 
one identified in a drug development trial, would be 
likely to induce CYP3A4 , and hence cause drug-drug 
interactions, or auto-induction of the metabolism of the 
drug under study. r 
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DESCRIPTION OF THE INVENTION 

The invention seeks to address the above identified need 
and in a first aspect provides a non-human mammal 
comprising : 

5 

(a) a regulatory nucleic acid molecule which is capable 
of regulating transcription of the human CYP3A4 gene and 
which comprises a nucleotide sequence that is identical to 
a sequence of the human CYP3A4 gene located between the 

10 initiation of transcription site of the gene and a ' 

position located at least 13,000 nucleotides upstream from 
the site; and 

(b) a reporter nucleic acid molecule for producing a 
15 detectable amount of a reporter molecule for indicating 

regulation of transcription of the reporter nucleic acid 
molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
2 0 are arranged to permit the regulatory nucleic acid 

molecule to regulate transcription of the reporter nucleic 
acid molecule . 

As described herein, the inventors have found that the 

2 5 incorporation of a region of the human CYP3A4 gene that is 

located between the initiation of transcription site of 
the gene and a position 13,000 nucleotides upstream of the 
initiation of transcription site into an animal model 
provides the animal with sufficient genetic information 

3 0 for reproducing the constitutive and xenobiotic induced 

tissue specific expression of the CYP3A4 gene that is 
observed in humans. More specifically, the inventors have 
generated animal models which contain a transgene 
comprising this region and have observed that these models 
35 provide constitutive 'arid' xenobiotic inducible " expression 
of a transgene in a tissue pattern which reproduces the 
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tissue specific expression of CYP3A4 which is observed in 
a human. Importantly, the level of constitutive 
expression is sufficient to allow one to observe the 
effect on the regulation of tissue specific transgene 
5 expression, of administration of a compound, for example, 
a xenobiotic or steroid, to the animal. 

Further, the inventors have observed that the animal 
models described herein also reproduce aspects of the 
10 constitutive and xenobiotic inducible developmental 

expression of the CYP3A4 gene that is observed in humans. 

These findings are unanticipated because prior to the 
invention, there was no suggestion that the genetic 

15 information required for simulating the constitutive and 
xenobiotic induced tissue specific or developmental . 
expression of the CYP3A4 gene that is observed in a human 
would be contained in the region of the human CYP3A4 gene 
between the initiation of transcription site of the gene 

20 and a position 13,000 nucleotides upstream of the 
initiation of transcription site. 

Further, prior to the invention, differences in the 
induction profile of the mouse CYP3A11 and the human 

25 CYP3A4 gene had been observed, and differences had also 
been observed in the ligand binding profile of mouse 
transcription factors, especially PXR and CAR, and human 
PXR and CAR. Accordingly, there was no suggestion that a 
non-human animal would have factors sufficient for 

30 interacting with a region of the CYP3A4 gene for 

reproducing the constitutive and xenobiotic induced tissue 
specific or developmental expression of CYP3A4 observed in 
a human . 

i' 

3.5 Further ; prior to the invention,, mechanisms, associated 

with transgene integration had been observed, such as gene 
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silencing and mosaic transgene expression which limited 
the extent tq which an a transcriptional enhancer element 
incorporated into a trangenic model could reproduce 
regulation of gene expression observed in a human. 
5 Accordingly, there was no suggestion that a region of the 
human CYP3A4 gene would be capable of reproducing the 
regulation of expression of the CYP3A4 gene that is 
observed in a human. However, as described herein, the 
inventors have shown in 2 separate founder lines that the 
10 expression of the transgene reproduces aspects of CYP3A4 
gene expression that are observed in humans. 

Thus in a second aspect, the invention provides a non 
human mammal comprising: 

15 

(a) a regulatory nucleic acid molecule comprising- a 
nucleotide sequence that is identical to the nucleotide 
sequence of the human CYP3A4 gene that extends about 
13,000 nucleotides upstream from the initiation of 

2 0 transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing a 
detectable amount of a reporter molecule for indicating 
regulation of transcription of the reporter nucleic acid 

25 molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
are arranged to permit the regulatory nucleic acid 
molecule to regulate transcription of the reporter nucleic 
30 acid molecule. 

In one embodiment, the regulatory nucleic acid molecule 
comprises the sequence shown in SEQ ID NO : 1 . 

i' 

35 Further, as described . herein, the inventors have generated 
transgenic animals which contain a region of the human 
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CYP3A4 gene between the initiation of transcription site 
and a position about 3,200 nucleotides upstream of the 
initiation transcription site and observed that the 
transgene is not constitutively expressed or inducible by 
5 xenobiotics in these animals. Accordingly, the inventors 
■ ' have found that the genetic information required for 

reproducing the constitutive and xenobiotic induced tissue 
specific and developmental expression of CYP3A4 observed 
in a human is contained in the region of the human CYP3A4 
10 gene between the position located about 3,200 nucleotides 
upstream of the initiation of transcription site of the 
gene and a position 13,000 nucleotides upstream of the 
initiation of transcription site. 

15 Thus, in a third aspect, the invention provides a non- 
human mammal comprising: 

(a) a regulatory nucleic acid molecule comprising a 
nucleotide sequence that is identical to the sequence of 

20 the human CYP3A4 gene that extends about 8,000 nucleotides 
upstream from a position about 3,000 nucleotides upstream 
from the initiation of transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing a 
25 detectable amount of a reporter molecule for indicating 

regulation of transcription of the reporter nucleic acid 
molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
30 are arranged to permit the regulatory nucleic acid 

molecule to regulate transcription of the reporter nucleic 
acid molecule. 

In one embodiment/ the regulatory nucleic acid molecule »• 
35 comprises the sequence shown, in. SEQ ID NO : 2 . 
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In a fourth aspect, the invention provides a non-human 
mammal comprising: 

(a) a regulatory nucleic acid molecule which is capable 
5 of regulating transcription of the human CYP3A4 gene and 

which comprises a nucleotide sequence that is identical to 
the sequence of the human CYP3A4 gene that extends about 
600 nucleotides upstream from a position about 7,200 
nucleotides upstream of the initiation of transcription 
10 site of the gene; and 

(b) a reporter nucleic acid molecule for producing a 
detectable amount of a reporter molecule for indicating 
regulation of transcription of the reporter nucleic acid 

15 molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
are arranged to permit the regulatory nucleic acid 
molecule to regulate transcription of the reporter nucleic 
20 acid molecule. 

In one embodiment, the regulatory nucleic acid molecule 
comprises the sequence shown in SEQ ID NO : 3 . 

25 In another embodiment, the regulatory nucleic acid 

molecule has the sequence of any one of the following 
fragments of the CYP3A4 gene: 

(i) a fragment consisting of from nucleotide positions 
-13,000 to + 53; 

30 (ii) a fragment consisting of from nucleotide positions 
-13,000 to -12,700 contiguous .with -8000 to + 53; 
(iii) a fragment consisting of from nucleotide positions 
-13,000 to -5,100 contiguous with -1,200 to + 53; 

(v) a fragment consisting of from nucleotide positions >' 
35 -7,800 to -6,000 contiguous with -362 to +53;. 

(vi) a fragment consisting of from nucleotide positions 
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-7,500 to -6,000 contiguous with -362 to + 53; 

A regulatory nucleic acid molecule which has the sequence 
of a fragment consisting of from nucleotide positions 
5 -7836 to -7207 contiguous with -362 to +53 is particularly- 
preferred, as this construct contains the minimal 
sequences necessary for regulating transcription of the 
human CYP3A4 gene, more specifically, an element 
responsive to xenobiotics (the "Xenobiotic Response 
10 Element Module" or U XREM" ) and the proximal promoter of 
the CYP3A4 gene. 

The regulatory nucleic acid molecule of the invention 
typically contains at least one enhancer capable of 
15 regulating transcription of a human CYP3A4 gene when 
contacted with a nuclear receptor. Examples of such- 
enhancers are those capable of regulating transcription of 
a human CYP3A4 gene when contacted with a nuclear receptor 
bound to a ligand, such as a xenobiotic or steroid. Other 

2 0 examples are those capable of regulating transcription of 

a human CYP3A4 gene when contacted with a nuclear receptor 
consisting of a heterodimer of PXR (pregnane X receptor, 
otherwise known as SXR (steroid and xenobiotic receptor) ) 
and RXR (9-cis retinoic acid receptor) , or CAR 
25 (constitutive androstane receptor-(3) and RXR. 

The inventors believe that certain nucleic acid rr.olecules 
which have substantially the same nucleotide sequence as a 
regulatory nucleic acid molecule of the invention would 

3 0 also have sufficient genetic information for reproducing 

the constitutive and xenobiotic induced tissue specific 
and developmental expression of the CYP3A4 gene that is 
observed in a human. Accordingly, it will be understood 
that nucleotides could be modified or deleted in regions 1, 
35 of the regulatory nucleic acid .molecule , more 

specifically/ those regions which do not contain an 
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enhancer such as those described above, without 
significantly^ limiting the capacity of the molecule to 
regulate transcription of the human CYP3A4 gene. 

5 The inventors recognise that it would be advantageous to 
provide an animal model further capable of reproducing the 
expression of other human genes, specifically those genes 
encoding products which modify or modulate the therapeutic 
activity of exogenous and endogenous compounds used for 

10 therapy and cause drug-drug interactions, for example, 
cytochrome P450 genes or ABC transporter . superfamily 
genes, for example, P-glycoprotein (otherwise known as 
MDR-1) . The regions controlling the constitutive and 
xenobiotic induced tissue specific expression of some of 

15 these genes are known, and in some instances, non-human 

animal models have been generated. The inventors recognise 
that the genetic background of these animals could be 
incorporated into the non- human mammal of the present 
invention, for example, by conventional breeding 

20 techniques. 

Thus in a fifth aspect, the invention provides a non-human 
mammal of any one of the first to fourth aspects of the 
invention, further comprising: 

^ _> 

(c) a further regulatory nucleic acid molecule which is 
capable of regulating transcription of a human gene; and 

(d) a further reporter nucleic acid molecule for 

3 0 producing a detectable amount of a further reporter 

molecule for indicating regulation of transcription of the 
further reporter nucleic acid molecule by the further 
regulatory nucleic acid molecule 

t i 

35 wherein the further reporter and further regulatory 

nucleic acid molecules are arranged to permit the further 
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regulatory nucleic acid molecule to regulate transcription 
of the further reporter nucleic acid molecule. 

In one embodiment , the at least one further regulatory 
5 nucleic acid molecule has a sequence shown in SEQ ID NO:4. 
In another embodiment, the at least one further regulatory 
nucleic acid molecule has a sequence shown in SEQ ID NO: 5. 

Although the regulatory nucleic acid molecule of the 

10 invention described herein is sufficient for reproducing 
the constitutive tissue specific and developmental 
expression of the CYP3A4 gene that is observed in a human, 
the inventors recognise that aspects of the xenobiotic 
inducibility of the gene could be better reproduced in an 

15 animal by incorporating at least one human transcription 
factor that is capable of interacting with the regulatory 
nucleic acid molecule for regulating transcription " of the 
human CYP3A4 gene. Examples of such factors are nuclear 
receptors . These receptors may be those capable of 

20 regulating CYP3A4 gene transcription in a human when the 
receptor is bound to a ligand, such as a xenobiotic or 
steroid. One example of such a receptor is the human PXR 
(pregnane X receptor, otherwise known as SXR (steroid and 
xenobiotic receptor) ) . Another suitable receptor is the 

2 5 huraan CAR (constitutive andro stane re copter -p) . jTen-hurr^:^ 
animals comprising a human PXR or CAR receptor are known. 
The inventors recognise that the genetic background of 
these animals could be incorporated into the non-human 
mammal of the present invention, for example, by 

30 conventional breeding techniques. 

Thus in a sixth aspect; the non-human animal of the 
invention further comprises at least one human 
transcription factor for regulating transcription of a il 
35 human CYP3A4 gene. Preferably the transcription factor is 
a nuclear receptor. Preferably, the nuclear receptor is a 
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heterodimer of the human PXR (pregnane X receptor, 
otherwise known as SXR (steroid and xenobiotic receptor)) 
and human RXR (9-cis retinoic acid receptor) or human CAR 
(constitutive androstane receptor-(i) and human RXR. 

5 

It follows that the reporter nucleic acid molecule can be 
any molecule which is capable of detection when the 
reporter nucleic acid molecule is transcribed. For 
example, the reporter nucleic acid molecule could be the 

10 CYP3A4 cytochrome, or the mRNA transcript which is 

translated to produce the cytochrome. Those reporter 
molecules which are commercially available, including 
firefly lucif erase, galactosidase, alkaline phosphatase, 
green fluorescent protein or chloramphenicol acetyl 

15 transferase can be used. 

Thus in one embodiment, the reporter nucleic acid molecule 
is capable of producing a reporter molecule selected from 
the group of reporter molecules consisting of firefly 
20 luciferase, p-galactosidase, alkaline phosphatase, green 

fluorescent protein or chloramphenicol acetyl transferase. 

While the non-human mammal of the invention, as 
exemplified below, is a mouse, the inventors believe that 
25 any other non-human mammal could be used in the invention, 
especially those for which standard transgenic techniques 
have been developed including for example, rat and rabbit. 
However, typically the non-human mammal is a mouse. 

3 0 In another aspect, the invention provides a tissue of a 
non- human mammal of the invention. 

In one embodiment, the tissue is an embryo capable of 
producing a non-human mammal of the invention. »• 

35 
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In a further aspect, the invention provides a method of 
determining whether a compound is capable of effecting the 
transcription of a human CYP3A4 gene the method comprising 
the following steps: 

5 

(a) administering the compound to a non human mammal 
according to the invention and 

(b) determining whether the reporter molecule is produced 
by the reporter nucleic acid molecule in the mammal . 

10 

In one embodiment, the production of the. reporter molecule 
indicates that the binding compound is capable of 
effecting the transcription of the human CYP3A4 gene. 

15 Any compound can be tested in the method however, 

preferred compounds are xenobiotic or steroid compounds. 

The inventors recognise that 'a non human animal which 
comprises a 5' flanking region of CYP3A4 gene but which is 
20 deficient for the region from -7836 to -7207 would be 

useful as a negative control in a method for determining 
whether a compound is capable of regulating transcription 
of the human CYP3A4 gene. 



BRIEF DESCRIPTION OF THE FIGURES 
Figure 1. CYP3A4/2acZ transgene constructs used to generate 
transgenic mice. The upstream regions of the human CYP3A4 

3 0 gene are depicted as open boxes with the position of the 
XREM at approximately -7.5kb of the CYP3A4 gene indicated 
by cross-hatching. The 5' -flanking region extended from 
56bp downstream of the transcription initiation site to £ 
Hindlll site at -3,213 in the construct designated - 

35 3CY?3A4/IacZ and to a Kpnl site at -12,926 kb in construct 
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-13CYP3A4/lacZ . The coding region of the E.coli lacZ gene 
together with eukaryotic translational initiation and 
termination signals, transcription termination and poly 
adenylation sites are indicated by a solid box. 

5 

Figure 2. Xenobiotic induction of hepatic transgene 
expression. Female mice from line 9/4 harbouring the - 
13CYP3A4/IacZ transgene were treated with various reagents. 
Histochemical staining of liver. slices with X-gal revealed 
10 an increased zone of blue staining cells- containing p- 
galactosidase after treatment with rifampicin, 
phenobarbital and pregnenolone 16a-carbonitrile compared 
with corn oil treated mice. 

15 Figure 3. Comparison of the xenobiotic induction profile 
of the -13CYP3A4/lacZ transgene with the mouse Cyp3all 
gene. Transgenic mice from line 9/4 were treated with a 
range of xenobiotic reagents and naturally occurring 
steroids. A. Transgene expression was assessed by 

20 determining p-galactosidase activity in total liver lysates 
using the ONPG assay. The units of p-galactosidase activity 
are given as A 42 o/mg liver/minute. Dexamethasone and 
pregnenolone 16a-carbonitrile were the most potent 
xenobiotic activators of the - 13CYP3A4/2acZ transgene, 

25 while rifampicin treatment resulted in relatively low 

levels. The steroids pregnenolone and 17a- progesterone 
were very weak inducers. B. Hepatic expression of the 
endogenous mouse Cyp3all gene was examined in the same mice 
by Northern analysis. A similar pattern of induction to 

3 0 the CYP3A4 / lacZ transgene was observed with both xenobiotic 
^ and endogenous regulators. The data are presented as the 
mean +/- the standard deviation for 3 animals. 
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Figure 4. Dose response of -13CYP3A4/lacZ transgene 
expression after treatment with dexamethasone . A, Male 
mice from line 9/4 were treated with from 1 to lOOmg/kg 
5 dexamethasone. Higher doses of dexamethasone resulted in 
increased p-galactosidase activity (determined in liver 
lysates as described in Fig. 3) .B. Zonal expansion of 
transgene expression with increasing doses of 
dexamethasone. X-gal staining of frozen liver sections 

10 revealed greater numbers of hepatocytes containing 

transgene-derived p-galactosidase activity after treatment 
with 1, 10 and 100 mg/kg dexamethasone. At low doses there 
are limited numbers of transgene -expressing cells 
immediately adjacent to the central vein. With higher 

15 doses there are more cells committed to transgene 

expression extending across the liver lobule towards the 
portal tract. 

Figure 5. (SEQ ID NO:l) Sequence of the CYP3A4 5 '-flanking 
20 region included in the -13 CYP3A4/lacZ construct. This 

sequence corresponds to -12,926 to +56 base pairs relative 
to the transcription initiation site of the CYP3A4 gene. 

Figure 6. (SEQ ID NO : 2 ) Sequence of the 5' -flanking region 
25 of the CYP3A4 gene extending from -12,926 to -3,213 base 
pairs and representing the difference in sequence between 
the -13 CYP3A4/lacZ and the -3 CYP3A4/lacZ constructs. 

Figure 7. (SEQ ID NO : 3 ) The "Xenobiotic -Responsive 
30 Enhancer Module" (XREM) of the human CYP3A4 gene. This 

region encompasses -7836 to -7207 base pairs relative to 
the transcription initiation site of the CYP3A4 gene. 
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Figure 8, (SEQ ID NO:4)The 5' -flanking region of the 
• human CYP3A7 gene (Genbank Accession No. AF329900) . The 
extent of the sequences is -11,133 to -f-52 base relative to 
5 the transcription initiation site of the CYP3A7 gene. 

Figure 9. (SEQ ID NO : 5) Sequence of the 5 '-flanking region 
of the human MDR1 gene (p -glycoprotein gene) encompassing - 
10,000 to +200 base pairs relative to the transcription 
10 initiation site of the MDR1 gene. Sequence derived from 
within Genbank sequence Accession Number AC002457. 

An embodiment of the invention is now described in the 
following Example which will be understood to merely 
15 exemplify and not to limit the scope of the invention. 

EXAMPLE 

MATERIALS AND METHODS 

Transgene constructs. Two transgene constructs were 
20 synthesized with the upstream 5' flank of the human 

cytochrome P4 5 0 CYP3A4 gene linked to the E. coli lacZ 
reporter gene (Figure 1). The first construct, designated 
-3CY?3A4/2acZ, contained the region of the CYP3A4 gene from 
the Hindlll site at -3213bp relative to the transcription 
25 start site to nucleotide +56bp downstream of the 

transcript ion start site. The other construct, designated 
-13CYP3A4/lacZ, included the region of the CYP3A4 gene from 
the Kpnl site at -12,926bp upstream to 4-56bp downstream of 
the transcription start site. It includes the DMA 
30 sequences of the XREM region located between -7836 and - 
72 08 in addition to the proximal promoter of the CYP3A4 »' 
gene. The DMA sequence of the CYP3A4 gene between -10468bp 
and +906bp has been determined and deposited with the 
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GenBank/EMBL/DDJB database under accession number AF185589. 
Additional sequence information covering the region - 
10,469bp to -12,926bp was obtained from publically 
accessible Genbank files. The E.coli lacZ reporter gene 
5 comprises the coding region for the bacterial enzyme p- 
galactosidase flanked by DNA sequences for eukaryotic 
translational start and stop signals, SV40 transcriptional 
termination and polyadenylation signals and an intron. The 
CYP3A4/lacZ transgene constructs were released from vector 
10 sequences *and purified on agarose gels prior to 
microinj ection 

Generation of transgenic mouse lines. Mice carrying the 
CYP3A4/lacZ transgenes were created by microinjection of 
the DNA constructs into the pro-nuclei of zygotes harvested 

15 from FVB/N strain mice. Microinjection and manipulation of 
embryos were carried by standard techniques. Stable 
transgenic mouse lines were established by breeding from 
transgenic founders identified by Southern analysis. 
Administration of xenobiotics to mice. 8-10 week old male 

20 and female mi,ce hemizygous for the -3CYP3A4/2acZ and - 

13CYP3A4/lacZ transgenes were used to test the ability of a 
range of xenobiotics and hormones to activate expression of 
transgene-derived p-galactosidase . Mice were administered 
the following reagents and vehicles by single daily 

25 intraperitoneal injection for 4 days: rif ampicin/corn oil; 
dexamethasone phosphate/H 2 0; pregnenolone 16a- 
carbonitrile/2% Tween 20 in H 2 0; phenobarbi tal/H 2 0; 
clotrimazole/2% Tween 20; phenytoin/2% Tween 20; I7ct-0H 
progesterone/2% Tween 20; pregnenolcne/2% Tween 20. All 

30 reagents were supplied by Sigma Chemical Co. (St Louis, MO) 
except for dexamethasone phosphate which was obtained from 
Faulding (Mulgrave, Australia) and pregnenolone 16a- 
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for all reagents to test for induction of the transgene was 
lOOmg/kg body weight. Dose response studies were carried 
out in the range of l-100mg/kg with male hemizygous 
5 transgenic mice. 

Analysis of transgene and mouse Cyp3a gene expression, p- 
galactosidase activity was visualised in slices and frozen 
sections of liver and other tissues by staining with X-gal 
(5-bromo-4-chloro-3-indolyl-p-D-galactopyranoside) . 

10. Tissues wkre fixed in 0.25% glutaraldehyde, 0 . 1M phosphate 
buffer pH7.3, 5mM EGTA, 2 mM MgCl 2 : washed in 0 . 1M 
phosphate buffer pH7.3, 0.01%sodium deoxycholate, 0.025% 
NP40, 2mIVI MgCl 2 and stained by incubation at 37°C in wash 
solution supplemented with lmg/ml X-gal, SrruM potassium 

15 f erricyanide, and 5mM potassium f errocyanide . The level of 
p-galactosidase activity was 'determined in whole liver 
homogenates [lOOmg fresh tissue/ml 0.25M Tris-HCl (pH 7.3)] 
using the O-nitrophenyl-p-D-galactopyranoside (ONPG) assay 
according to standard techniques. After appropriate 

2 0 dilution the • homogenate was incubated v/ith p-galactosidase 
assay reagent (0.1M sodium phosphate buffer (pH7.3)/lmM 
MgCl 2 /50 mmol p-mercaptoethanol/0 . 88mg/ml ONPG) at 37°C, 
quenched by the addition of 1M Na 2 C0 3 and the absorbance at 
420nm determined. The units of p-galactosidase activity 

25 are given as A 420 /mg liver/minute. 

The levels of endogenous mouse Cyp3a mRNA expression were 
determined by Northern analysis using a riboprobe 
complementary to nucleotides 852-1051 of the mouse Cyp3all 
cDNA. Filters were stripped and reprobed with an 18S rRNA 

30 oligonucleotide to normalise loading. 
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RESULTS 

4 transgenic . lines were generated with the construct 
containing the -3.2kb region of the human CYP3A4 gene 
linked to lacZ. Transgene- derived p-galactosidase activity 
5 was not detected in kidney, large and small intestine, 
spleen, lung and liver tissue from mice for all 4 - 
3CYP3A4/2acZ transgenic lines treated with vehicle or 
xenobiotics (Table 1) . In contrast, transgene expression 
was readily detected in 3 of the 4 lines carrying the - 

10 13CYP3A4/2acZ construct. Line 9/4 had a very low 

constitutive level in the liver, with p-galactosidase 
detected only in isolated hepatocytes adjacent to major 
blood vessels. Administration of xenobiotics resulted in 
robust expression in a zone of cells surrounding the 

15 central vein (Figure 2) . As the basal level of transgene 
expression in untreated mice in line 9/4 is extremely low, 
induction is obvious and is essentially an off /on process. 
Expression in other tissues in mice from line 9/4 was 
restricted to the gut, predominantly in the villi of the 

20 small intestine. 

The relative degree of induction for a range of xenobiotics 
was analysed by determining the transgenic p-galactosidase 
activity in liver lysates of mice from line 9/4 (Figure 
3 A) . Dexamethasone and pregnenolone 16a-carbonitrile were 

25 the most potent inducers, while rifampicin activated the 
transgene to relatively modest levels. Phenobarbital , 
clotrimazole and phenytoin were intermediate inducers. The 
induction profile of the transgene in line 9/4 was similar 
to that observed for the endogenous CypSall gene in the 

30 same mice (Fig 3B) , likely reflecting the activation 

profile of the mouse rather than the human PXR. Activation 
of the transgene was observed with naturally occurring 
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steroids such as pregnenolone and I7a-progesterone, hov/ever 
the induction was weak compared with xenobiotics. 
There was a marked gender difference in hepatic transgene 
expression, with lower levels observed in females than in 
5 males for most reagents. Such a male-predominant pattern 
was not evident in the induction profile of the mouse 
Cyp3all gene. Indeed higher levels of Cyp3all mRNA were 
observed in females than males after treatment v/ith 
rifampicin and pregnenolone 16a-carbonitrile . The reason 

10 for this apparent reversal in gender-related transgene 

expression pattern is not known. However, as Cyp3all mRNA 
is only just detectable in males of the FVB/N strain of 
mice, it may be attributed to the relatively greater degree 
of induction of the mouse Cyp3all gene in males compared to 

15 females (Figure 3B) . 

The other line which showed significant transgene 
expression - 15/10, had a higher constitutive level in both 
the liver and small intestine in untreated mice. • 
Expression was not detected in other organs, confirming the 

20 tissue specificity observed in line 9/4. The same set of 
reagents were capable of increasing hepatic and intestinal 
transgene expression to the same levels as in mice from 
line 9/4. However, the overall degree of induction was not 
as great as observed in line 9/4 due to the higher basal 

25 level in line 15/10. The induction profile was similar 
v/ith dexamethasone being the most potent activator and 
rifampicin the least (data not shown) . 

Dose response of xenobiotic induction. The activation of 
transgene expression in line 9/4 by dexamethasone was dose- 
3 0 dependent over the range 1 to 10 0 mg/kg (Figure 4 A) . The 
higher transgene-derived p-galactosidase activity in liver 
homogenates from mice treated with increasing doses of 



WO 02/36784 



PCT/AU01/01407 



20 

dexamethasone was associated with an expanded zone of cells 
which were stained by X-gal . At low doses of dexamethasone 
a ring of hepatocytes only 1-2 cells thick around the 
central vein expressed the transgene (Figure 4B) . With 
5 lOOmg/kg dexamethasone the zone of X-gal positive 

hepatocytes increased to up to 10 cells, approximately 
midway between the central vein and the portal triad. A 
similar dose -dependent expansion of hepatocytes expressing 
the transgene was observed with other reagents and also in 
10 line 15/10 which also contained the -13CYP3A4/lacZ 
construct . 
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CLAIMS 

' 1- A non-human mammal comprising: 

(a) . a regulatory nucleic acid molecule which is 
capable of regulating transcription of the human 
5 CYP3A4 gene and which comprises a nucleotide sequence 

that is identical to a sequence of the human CYP3A4 
gene located between the initiation of transcription 
site of the gene and a position located at least 
13/000 nucleotides upstream from the site; and 

10 (b) a reporter nucleic acid molecule for producing 

a detectable amount of a reporter molecule for 
indicating regulation of transcription of the 
reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

15 wherein the reporter and regulatory nucleic acid 

molecules are arranged to permit the regulatory 
nucleic acid molecule to regulate transcription of 
the reporter nucleic acid molecule. 

2. A non human mammal comprising: 

20 (a) a regulatory nucleic acid molecule comprising a 

nucleotide sequence that is identical to the 
nucleotide sequence of the human CYP3A4 gene that 
extends about 13,000 nucleotides upstream from the 
initiation of transcription site of the gene; and 

25 (b) a reporter nucleic acid molecule for producing 

a detectable amount of a reporter molecule for 
indicating regulation of transcription of the 
reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

30 wherein the reporter and regulatory nucleic acid 

molecules are arranged to permit the regulatory 
nucleic acid molecule to regulate transcription of 
the reporter nucleic acid molecule. 

3. A mammal according to claim 2 wherein the regulatory 
35 nucleic acid molecule comprises the sequence shown in 

SEQ ID N0:1. 
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4. A non-human mammal comprising: 

(a) a regulatory nucleic acid molecule comprising a 
nucleotide sequence that is identical to the sequence 
of the human CYP3A4 gene that extends about 8,000 

5 nucleotides upstream from a position about 3,000 

nucleotides upstream from the initiation of 
transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing 
a detectable amount of a reporter molecule for 

10 indicating regulation of transcription of the 

reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

wherein the reporter and regulatory nucleic acid 
molecules are arranged to permit the regulatory 
15 nucleic acid molecule to regulate transcription of 

the reporter nucleic acid molecule. 

5. A mammal according to claim 4 wherein the regulatory 
nucleic acid molecule comprises the sequence shown 
SEQ ID NO:2. 

20 6. A non-human mammal comprising: 

(a) a regulatory nucleic acid molecule which is 
capable of regulating transcription of the human 
CYP3A4 'gene and 'which comprises a nucleotide sequence 
that is identical to the sequence of the human CYP3A4 

25 gene that extends about 600 nucleotides upstream from 

a position about 7,200 nucleotides upstream of the 
initiation of transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing 
a detectable amount of a reporter molecule for 

30 indicating regulation of transcription of the 

reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

wherein the reporter and regulatory nucleic acid 
molecules are arranged to permit the regulatory 
35 nucleic acid molecule to regulate transcription of 

the reporter nucleic acid molecule. 
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7. A mammal according to claim 6 v/herein the regulatory 
nucleic , acid molecule comprises the sequence shown in 
SEQ ID N0:3. 

8 . A mammal according to any one of the preceding claims 
5 wherein the regulatory nucleic acid molecule has the 

sequence of a fragment of the CYP3A4 gene consisting 
of from nucleotide positions -7836 to -7207 
contiguous with -362 to +53. 

9 . A mammal according to any one of the preceding 
10 claims, further comprising: 

(c) h a further regulatory nucleic acid molecule 
which is capable of regulating transcription of a 
human gene; and 

(d) a further reporter nucleic acid molecule for 
15 producing a detectable amount of a further reporter 

molecule for indicating regulation of transcription 
of the further reporter nucleic acid molecule by the 
further regulatory nucleic acid molecule; 
wherein the further reporter and further regulatory 
20 nucleic acid molecules are arranged to permit the 

further regulatory nucleic acid molecule to regulate 
transcription of the further reporter nucleic acid 
molecule . 

10. A mammal according to claim 9 wherein the at least 
25 one further regulatory nucleic acid molecule has a 

sequence shown in SEQ ID NO : 4 . 

11. A mammal according to claim 9 v/herein the at least 
one further regulatory nucleic acid molecule has a 
sequence shown in SEQ ID NO : 5 . 

30 12. A mammal according to any one of the preceding 
claims, further comprising at least one human 
transcription factor for regulating transcription of 
a human CYP3A4 gene. 
13 . A mammal according to claim 12 wherein the . 

35 transcription factor is a nuclear receptor. 
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14. A mammal according to claim 13 wherein the nuclear 
receptor is a heterodimer of the human pregnane X 
receptor and human 9-cis retinoic acid receptor or a 
heterodimer of human constitutive androstane 

5 receptor-P and human 9-cis retinoic acid receptor. 

15. A mammal according to any one of the preceding claims 
wherein the reporter nucleic acid molecule is capable 
of producing a reporter molecule selected from the 
group of reporter molecules consisting of firefly 

10 luciferase, p-galactosidase , alkaline phosphatase, 

green fluorescent protein or chloramphenicol acetyl 
transferase . 

16. A mammal according to any one of the preceding claims 
wherein the mammal is a mouse. 

15 17. A tissue of a mammal according to any one of the 

preceding claims. 
IS. A tissue according to claim 17 wherein the tissue is 

an embryo capable of producing a mammal according to 

any one of the preceding claims. 
20 19. A method of determining whether a compound is capable 

of effecting the transcription of a human CYP3A4 gene 

the method comprising the following steps: 

(a) administering the compound to a non human 
mammal according to any one of the preceding 

25 claims; and 

(b) determining whether the reporter molecule is 
produced by the reporter nucleic acid molecule ' 
in the mammal . 

20. A method according to claim 19 wherein the production 
30 of the reporter molecule indicates that the binding 

compound is capable of effecting the transcription of 
the human CYP3A4 gene. 
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Fig 1. Human CYP3A4//acZtransqene constructs 
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Fig 3B Xenobiotic induction of mouse CYP3A11 mRNA 
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Figure 5 . 

CTGGTTCATCTCATTGGGACTGGTTGGACAAGAGGGTGCAGCCCACGGAGGGTGAGCCAAAGCAGGGTGGG 
GCGTCGCCTCACCTGGGAAGCACAAGGGGTCGTGGAATTTTCTCCCCTACCCAAGGAAAGCCATAAGGGAC 
5 TGAGCCTGAGGAACTGTGCACTCTGGCCCAGATACTGCACTTTTCCCATGGTCTTTGCAACCCGCAGACCA 
GGAGATTCCCTCCGGTGCCTATGCCACCAGGGCCCTGGGTTTCAAGCACAAAACTGGGCAGCCATTTGGGC 
AGACACCGAACTAGCTGCAGGAGTTTTTTTTTTTTTTTTCCATACCCCATTGGCACCTGGAACGCCAGTGA 
GACAGAACCGTTCACTCCCCTGGAAAGGGGGCTGAAACCAGGGATCCAAGTGGTCTGGCTCGGTGGGCCCC 
ACCCCCATGGAGCCCAGCAAACAAAGATTCACTTGGCTTGAAATTCTTGCTGCCAGCACAGCAGCAGTCTG 

10 AGATTGACCTGGGACCCTCGAACTTGGTTGGGTGCTGTGGGGGGGCATCTTCCATTGCTGAGGCTTGAGTA 
GGTGGTTTTACCTTCGCGGTGTAAACAAAGCTGCTGGGAAGTTTGAACTGGGTGGAGCTCACCACAGCTCA 
GTAAGGCCACTGTGdCCAGACTGCCTCTCTGGATTTCTCCTCTCTGGGAAGGATATCTCTGAAAAAAAGGC 
AGCAGCCCCAGTCAGGGACTTATAGATGAAACCCCCATCTCCCTGGGACAGAGCCCCTCGGGGAAGAGGTG 
GCTTCCACCATTGTGGAAGACTGTGTGGCAATTCCTCACGGATTTAGAACTAGAGATACCATTTGACCCAG 

15 CAATCCCATTACTGGGTGTATACCCATAGGATTATAAATCATTCTACTATAAAGACACATGCACACTTATG 
TTTATTGTAACACTATTTACAATAGCAATGACCTGGAACCAATCCAAAAGCCCATCAATGATAGACTGAAT 
AAAGAAAATGTGGCACATATACACTGTGGAATACTATGCAGCCATAAAAAAGGATGAGTTCATGTCCTTTG 
CAGAGACATGGATGAAGCTGGAAACCATCiyTTCTCAGCAAACTAGCACAA 

ATGTTGTCACTCATAAGTGGGAGTTAAACAATGAGAACACATGGACACAGGGAGGGGAACGTCACACACTG 

2 0 GGG CATGTCGGGG AGTGGGGGCCTACGGG AGGGATAG CATTAG CAGAAATACCTAATGTA.GGTGACGGGTT 

GATGGGTGCAGCAAACCACCATGGCACATATACACCTATGTAATAAAACTGCACGTTCTGCLAC^ 
CAGAACTT^AAGTATAATTAATAATAATAATAATTTCTGGGCATGTAAGTAGCTGTCTTTCAGGTTCTACT 
TTGATACATATTCTGAGAGAATTAAACCTGTCAAAGAAACCTTGACTTTCAATGGCA.GGCACTGGAATTGA 
CCCTAATA^.TGTGTTTTGQGGTAAGCCTACTCATATTCTCAACCTGTCTGCAGTAGTCGTTAGAATCTGAA 
25 CTTCCTGAAGTTCATGTGCAAAGTTGAGTTAATTGTTTAAT^ 

AGGAAAATATTAGATATGTGTCATCACTGCTGGTATTATTTAAACTGCAACATATTTTAGCTGGCTGCTGA 
TCTCAGCCACCATGCCTGCATTTTATCTCTGTCTCGTGG 

TAGAATCCTGGGCATCAAGAACATGTGGTTCTAATGGCTAGATAGGGAATGAGAGTAAAAGGATTTTGC'CC 
ACGGTCACGTGAGTAAACAACAGATTTGGAGGGGTCTGGACTACTGTGATGACTTCATTCTGACAATATGT 

3 0 TCCAGTTGTCCTTTCATTTCCTCCTAATCACATGTCTGGTCTGATCTGGCTGTTTCCCACCTTCCAATTCC 

TGCCTTCTCCAATGCTCCCTTCCGTAGGTCACTCTGTGGCTCAGAGACCCTGCTTAGCAAGCGCCCAACCT 
TTCAATTATTTGTTCAGTAAAACTTGAACTCATGTCTCCCCTTCTTGATAAAAAGAAAATACGTTATGTAA 
TGTCGGGTTACTCTATAACTCTTGTCCTGTCTCTCGGCAACTACTGAACTAACTGTTTTCATATTGAGCAA 
ACGTTTATGGAAGGACTGCCAAGAGTCAGGTACTAGGCTTGGTAATATTCCCCGTTCTCTCTAGTCAAAGC 

3 5 CAACACCAGCCAGACTTGCAGATCTAGGTCCCAAGCCCACTGCAGATCACAGGCCAGGGTCTGGTCTCCTC 

TGAGCTCCTTTGGGAGGGAAAGACAGAATTATTAACACCCATTTTGTAGATTAGGCAACTGAGGCTGAGGA 
AGTTT^vA^.T^CTCAGACAGGGCCTGCACGTCAGTCATATTCCAAGGATCCCTACTCACTGTCTTCTCTCT 
AOTiGAACGAGATGTCTCTGGAGTCCATAGAAAGCCCAGGAGCCTGGCTGGGCACGGTGGCTCCTGCCTGTA 
ATCCCAGChCTTTGGGAGGCCGAGGCAGGCAGATCACCTGAGCTCAGGAGTTCAAGACCAGCCTGGGCAAC 

4 0 atggcaaaaccccatctctactaaaaatacaa^ 
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GCTACTTGGGAGGCTGAGGCACAAGAATTGCTTGAGCCCAGGAGGCAGCAGTTGCAGTGAGC7GAGATTGT 

GCCAGTGCACTCCAGCCTGGGCAACAGAGCAAGATTCCATTTCAAAAACAAAAACAAACACAAACAAACAA 

ACAAAAATAGAAAGCCCAGGGACCACCTGCGTCAGGTTCCCAGCCACACCTTTTTCTTGTCCTCCTCTGTC 

TCTGGCATCTTCTCACAGGTTCCTAATTGTTTGTGGTTGCACAAATTCAAAATCCCAGAAAAATTACCACT 

TCACACCCACTCAGATGGCTATTTTTTTTTTGAAGGAAGATAACAAGTGTTGACAAGAACATGGAGAAATT 

GGAATTCTCACCCATTGCTGGTGAGAATGTAATACGGTGCTGCTGCTATGGAAAACAGCTTGGAGTTTCCT 

CAAAAAGTTCAACAGAATTTCAATGTGACCCAGCAATTCCCCTCTAAGTTATAGATCTGAGAGGATTAAAA 

ACAGTTACTAAAATACACGGACTCACATATTTCTAACAGTCCAATTCACAAGGGCCAAAAGGTGCTAATAG^ 

CCCACATGTCCATCGATGGATGGATAAATAAATTGTGGTCTATCCATACAATGGAATATTATTCGGCCATA 

AATGGAATGAAGTACTGACGCATGCTACAGAATGGATGAACCGCAAAAAAAATGGATGAACACATGCTACA 

GAATGGATAGCCTCACTTTACTATGAAGTGAAGGCCAGAAACGAAGTCCATATATTGCATCATACAAAATA 

TCCAGAAGAGGGAAGbcCACAGAGACAGAATGTGCAATGGTGGATGCCAGGGTCTGGGGAGAGGGGAGAGT 

GGGGAGAAACTGCTCAACTGGTACAGGCTTTATTTTGGAATGATGGGAACATTTTGCAACTAGATAGAGGT 

AGTGATTGCAGAACACAGAATGTACTGAATTCCACTGATTTTTTTCACCTTAAAATGGTT^ATTTTCAGTC 

CTGAGATTGGATAATCATAAAAAAATGGTTAATTTTATGTTATGTGAATTTCATCCCTATACATATTTTAA 

ACCTCAGAAATATACACTAGCAGGCATGGAACAGGTCACTGTGGTGCCTGCCAAGCCCGGTGATGTTATCT. 

GGGGTCCCCGGCCAGCCTTAAGCCTCTTGCTGACCGGTGGAGGGCAGAACCTTTGCCCTAAhAGTATAATA 

TCCACATGCTGGCATGATTCCTGGCCAGATGGCTTCTTTATTAGCAGTAATTGAAACTGCCTCGATACAGA 

CACTGTACCTTGCAACCAAAAAATGACTCAACAATGATAATAAGGGTTAAGCTGGGCCTTTCTCTCTTTGC 

CAGTTAAATTATATTTATTATAGCTTG ACATGAAAAACAAAG C AACTC CAACAGGTATCAC AAGGG CAAAG 

GACATGAACATTTTATCAAAGAAGAAATGCAGCTGTCAAAAATACAGAAA 

AAGTGGCTGGGCTCAGTGGTTCATGCCTGTAATCCCAGTGCTTTGCAAGGCTGAGACAGGAGGATCATTTG 
AAGCCAGAAGTTCAAGACCATCCTAGGCAAGTCAGTTCAATACCAGACTTCATGTCTACAAAACATCAAAA 
AATTAGCCAGGCATGGTGAyGGATGCCTGTTGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTG 
AGCCTGGGAGGCTGCGGTGGCGGTGAGCCATGATTGTGCCATTGTACTCCAGCCTGGGCAATGCAGCAAGA 
CTGTCTAAATAAGAAAAATAATAGTAAAGAAAAGGATTGGGATGCCATTTACT^ 

AGTTAAAAGTAATTTCTACGTTTTCTATTTTTTTATTACTAAAAAAAGCTGGACCATTCTCACAGCCTGAA 
ATGCTTCTCACTTTCCCTTCTTCTGTCCAAACACTTCTCTATGATAATGCAAACAGTCACTCCTTTAGGAA 
GACTTCACCCCAGGTAGTTCCAGATCCCCTTATCTCTGCCTTCCCAGAACTCCTGGTGTCXCTCCAGTTCC 
CTCCGTGTGGTGAAGTACCCTACCTAGGGTTTCAGTATGGCTCTGTCTGCAAAGGTCTTGTTCACACCTTC 
CCTTATGGTTCTGTTGCCCTGTGTTGTGTCATAGCACAGGGCACAGTGGAGAACCCATTCACACTGATAGA 
GAGGGCCCCATGGTCCTGGAGATAACCATGTAACCGATCAGAATAAGGCATTGAGGGCTGGGTGTCAGGCG 
TGGGCTGCACTTGGGTGGGCAGGTCCCCTGGAAAGTCACTGGGTTTGGCAAGCTTCCTAGT.-ACATGTCTC 
TCTGGGGTCCCCCTTGGAACTTCATGCAAAAATGCTGGTTGCTGGTTTATTCTAGAGAGATGGTTCATTCC 
TTTCATTTGATTATCAAAGAAACTCATGTCCCAATTAAAGGTCATAAAGCCCAGTTTGTA-^.CTGAGATGA 
TCTCAGCTGAATGAACTTGCTGACCCTCTGCTTTCCTCCAGCCTCTCGGTGCCCTTGAAATCATGTCGGTT 
CAAGCAGCCTCATGAGGCATTACAAAGTTTAATTATTTCAGTGATTATTAAACCTTGTCC7GTGTTGACCC 
CAG G T G AATC A CAAG C TG AA C TT C TG A C AAG AACAAG C T A T CAT ATT C TTT TC AAT T A C A C- AAAAAAG T AA 

AAACACTAAGAAGGTATAAGAGCATAATGCTTTTATACTACTAAGAATAATGTTTTCTCT.--^ 
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GGTAGATGCTTT'CATCAGATTAAGAAAATTCCCTGCTATTAGTTGTTGAAGGTTTTTATATCATAAATGAA 
AGTTGAATATTATTATCATATATTATTAATATATTGTTATTGAACTATCAAAGCCTTTTCCTAAAACCATT 
GAGATGATCTTATAACCATTCTCCTTTAACCTGTTGACGAGATCATTGGTATTTATACTATTTCTCTGTTA 
ACCATTCTTGAGTCTCAGGTTTAAATTCAACTTGGTCATGGTGTGTCATCTTTGATCATTGCTGTCTGTGG 
5 CTTGCTACTGTTTTGTTTAGGATTTTTGCACTGATGCTCATCAATGAGACTGGCATGCCATCTTCCTTTGC 
AGTCCTGATTTTTTTCTGATTTGGATCATGTGGTTATGGCCCTCATGGAATGAGTTGGGCATGATGCCTTT 
TTTTCATGTCTCTGGATTGATGGGACACTTTGGATTCTCTCCAGATGGCCCTCAATGGTCCCTGCCTCCTC 
ATTGTTAGGCCCCTGGGCAAGCCCTTCTCATTTCTGGTAGGCCCAGGAACCTGTGGGGGTTTTGTTTGTTT 
GTTTGTTTCTTGAGTCGGAGTCTCACTCTGTCACCCAGGCTGGAGTTGGAGTGCAATGGCCCGATCTTGGC 

10 TCACTGCAACCTCCACCTCCCAGATTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGAATTACAG 
GCACCCACCGACACACCCTGCTAATTTTTGTATTTTTAGTACAGATGGGGTTTCACAATATTGGCCAAGCT 
GGTCTCGAACTCCTGATCTCATGATCTGCCCGGCTTGGCCTCCCAAAGTGTTGAGATTACAAGCATGAGCC 
ACCACACCCAGTGAACCTGTGGTTTTTAGAAGCTCCCCATGCATGTGAATGCTGTGAGCATCCCAGGATGA 
CAGCCACTGTGTGTTCAGCTGTTGGAACTGTGAGAAAGCACCAGTGGGACCTTCTCCAGCACCTGCCTGCT 

15 GAGTTCATGGAAGAGGCTTGTTGGGGAGATGATGCCCTGGCTGACTCCTGAAGGATGGTTAGGAATGCACC 
AGATGGAAGCTGGGTTGGACCCACTCTATGCTGAAGAACAGCTTGTGTGGACACAAGGAGACACGGATATG 
TCATTTTTGTAGAGCCTGAGGAGTGTCCAATCACACCATTTGCTTAAAACATCATGCACACTTGGAAAAGT 
GGACTGAGACCGAATGAAGAAGCTAACAGTGGCCAGATCAGAAAGGGTCTTGTGTTACTTCCTAGAGATAC 
TTAGATTTTATCCTGTGGGTGATAGGAGCAGTTGGAGdGACTGAAGACAAGGAAAGAAACATGTTTCAAGA 

20 TCTATGTTTTTCAAGACGCTTTTCTGGTGGCTGAGTAGGGAATTCCCTGGATAAGTCCTGCCCAGGGTCAG 
G CAAAACAAGTT AGGGGGTTACTG AAATAAGGAGTATGAG AAATGGTGTAGGTTGTG CTGACGTTTTGTAA 
CACATCTCATGATGATCTTCATTTCCTTCACTAATTTCCTGTTTCATTAATTCCCTTCCACGTGCTCTTCT 
GAAATTTGCCTCAC^TTCTCTGATTTCTCTTTTACCTGTTGGTTTCATCACCTTTTACTTTTTGCTTTCCT 
GGAAACACAAATGATTCTCATTGTGACATGTCAGAATTATTTGCAACATTTGCCTTTCTGCTGAAACCATG 

25 AGTTCACTGAATACACAATTTAGTAAAGTGTAGGATGCACATGTCGTTTTCGTGGTCACAACCAGCTCTGT 
AGCATTTTATAACTACACTGGCAGTGTGCTGGGAGGTGTAGAGAGAAATATTTATCACATGTGTGGCTGAC 
ACAACCTGCCAAGTTATTTTAGGAGCCTCCTTGGAATCCCAGCAAGAATGCTACCGGCACAAT.XTGTAATC 
ACAGCATCCTGCTCCATGCCTTGGCTTCATGGCATAGTCACTTCTGCAAGTCTCTTTCCAGCTGTCTGTTC 
CCATGTCTATAAAGTATGAGTTAAATCATCCTAACACTACTCATCTTACAAAGTTTTCTTGCTGATGTTAA 

3 0 GAGAGTTGGGAAAGAACTGTATAAACTGTGAAGTGCCATGGAGATGTTAGTGGTTACTTTATCAAGAAATA 
GACACTCTAGAATGGAGTAGAAAGCCAACAGTTATGATTGAGTCCTCCTCCTCTTCTTCTTTTTATTAATT 
TATAAAGAAAAGAGGTTTAATTGACTCACAGTTCCATATGGCTGGGGAGGCCTCGGGAAACTCTCAGTCAT 
AGCAGGAGGCAAa.GGGGAAGAAGGCACCTTCTTCACAAGGCGGCAGGAGAGAGAGAGCTCCTGTTCTTTTT 

tgtcataaagtctacagaagtgcttatacttcaggacaagggcaggcagagagaaggaaggacattgcttc 

3 5 accccagccctcactgacgagtttgctaggggacctcactttgtcccagagtagggcag;-_-.ctctggccac 

T ACC C ATT C AG AAGG C C TG G G C TG C A CTG C T AG TTC C T C AC T AA CT C TG TG TG G C C TT G G 3 C A AG G TTG Gp 
CCTGTGTTAACAGATTATGACCCTGGGCTCTCAAGCTAGAGGATCTAAATTTGAATCCTGGCTCTGCTAAA 
GCAATTAGTGATGTAAACTTTAATGGGTCAGTTAACCTTCCTGTGGCTTAGTTTGCTCATCTGTAAAATAG 
GGATCATAACAGTATCAATACCACATGATTGTTGGACAGATTGAATCAGTTAATGCAGGGGAAGTACTTAG 

4 0 CATGACACGTATTCACTATCATTTCCTGGAGTAAGAGCTGTGTGTGAGTGGGTGTGAGCATGTGTGAAACC 
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TTTTCTCTGCAATCTCAGTTAAGAAACCAATCCAGAATTTAAAGTTCAGGGCCTAAATGGGTGGTTATCTT 

CTCCCAGTTCCATCCTATCCCACCTTTGCTCTTCCTCCCGCCCACAGGAGCTGTTGGTCCTTGATTGGGCT 

GGAAGACCTGGTGGACCCTAAGTGATCTATAAGAGGAGAATAGAGAACAGGGAATGTCTTCAAAAATCTAG 

AGGGACACAGAGGCTGAGAGGCAGGCAGTCCTGCAGGGTCTTCTGATTGGGACAAGGAGA^CCTTGGTCTT 

CACAGGCCAATTCTGGTCAGTTTCCCCCATGGACAGATGAGGAAA.CkGGCCCAGGAATATCCAAGGTCTCA 

CACTTCCCATCTGTCAAGTCTTGTTGATTCTGTTGTATTCATGTCTCTCA^GGGAGATAGAGTTTAGGGA 

AGAAAGAAGGATCAACTGTGTCTGATACCACTGGGAGCTTAAGTAAAGGGTTCTTTTACTTCATAGCATTT 

ATCCCAATTTGTAATTCAGTATTATTTGTGTGGCTGTTTGGTGTCTCTTTCTCCTATATGAGTGCTAGCTT 

CATAAGGGCAAGGATTTTGATTCTTTAATATTTAGTGCTTGCCACATGCCCTGAACACAGCAGGCATACAG* 

GCTAACCAACATACAGTGGCATGAAAGTCATGAAAGTGAGACACCTACCTCCTCCAGTGCCAAGAGAGCAT 

AACCATGCACCTGTCACTCTCCTCAACACCACCCCCAAGCATGAGGCCCAAAAGCATTAGCTAATCCCCTC 

CTCCAGCCACTAAAACTTAAAGGCCAGGTGTGGTGGCTCCCATCTGAAATCCCAGAACTTCAGGAGACAGC 

AGCAGGAGGATCACTTGAGGCCAGGAGTTTGAGATCAGCCTGGGC71ACATAGCTAGGTCCCATCTGTACTA 

AAAATTAGCTGGGCGTTGTTGCATGCCTGTAGTCCCAGCTACTAAGGAGGCTGAGGTGGGAGGATCACTTG 

AGCCCAGGAGGTGGAAACAACAGTAAGCTATAATCACAGCACTGAACTCTAGCCTGGGCAACAGAGTGACA 

CCCTGCCTCAAAACAATTTTAAAAATAAATAAGAGCAAAACTTAGATACCACGTGGTCACCCCAACATGCA 

AAATCAAGTTTTCCCCTACTGAGAAGAATGGGGACTTGACAGCTGAGTTACAGAGAGATAATCTTCTTCTT 

CTTTTTTTTTTTTTGGTTTACATCCTCAAGATCATGACTTGTGAAATTTGAATCGAATACACATGTAATTC 

CAGAGCAATGTTGCCTCCGCATACCATCAGCAATTCACTTGGCTACTGGAAGTCAGGATAAGCTTCCCAGA 

AGAGAGGTACCACTTGGGCTACCAATATAAAAGGATGAAAATATCAGAGTGATGGTGTTCTTTACAACGTT 

GAGTCCCTGGACAGCCTGTCCACTGATGCTGATATCTGAGCCTAATGCTTCTCTGAATGTTGAGATTGAAC 

TTTGATCCAATGAAACTAGAACGAGAAAGAAGATAAGTCTTTCATTGTTGATAAGGACATTATGTTTCTCA 

TACTTGTATGATTATTTTTCCTTAGCTGTACTATAATTATCTGCTTATTTGTCTCTGCTCTATGTGCTTAG 

GGTACAAAGTTGACCAAGACCAACTTTGGTTGGAAGCATAGTACTAAGAGCACAGTACTGAGAGCACAGTA 

TTGAGAGCACAGCTTTAJ^AAAACATGATGAAGGCTTTAATACAGGAAATGAGCAGGGGAGAGGCATGTGGT 

GGTTGGATGTATCTTCCTTGACACAGTCAGTGCAGCTCTCAGTAGTCAAGTCCCTACATGTTAGAAGATGT 

TACCTTCTGTGGAATTAAGTGGCAGAACTTGCCTTCAATTATTTTCCTTTGCAGAACAACACCAACTGCAT 

TAGTTAGGACACAGTGCTGGCTGCATTTAAGTCCCAAGCGATGATTAGTCTCTCACTGTTGGTATAGATTC 

AAACCAATCAGACCACCTCCTAAAGTTTGTAGGGCAGGTAAATCCTCATCTTAGAATAAAAATCATCTTAC 

CAAGTATGTGTTTTAGAGGCAAGAAGAAAACATATTTGTTTCTGTAAGAGTTTTGTTTAAAAAAAATATAA 

GAAAGGCTCTCGGTTTAGGTGAGGTAATGAAGTTGTTGATAGTTATCAGATGACACTGGAATCTTTACTTC 

TCTGA^CGTGTTCTGTGCATCTCTCAGTGTGGGAACATAGAGAGGGAGATCCTCC/^GCi-ATGCCACTGATA 

TGGTCAGAAACTGCATCTTTCTTTCTCCCTGCTGAGATGAGATGGAGTCCTTTGTTCTAG.AAGACCCATGG 

TGGTGCCGCTGGGAGTAACCCTTGAGACAGGAACACAAATCCCAACCAATTTGTGGTTGCAGCCTTGAGTC 

TCACTATTTCCCATAGTGATGCGTAGCAGGGAATGGCAGGTGCACCAGAGCAGGAGAGGACCTAATATCTC 

CCTTCCTGTTAGCTTTTTATAAAGTTTTATTGTGATCAGTAGCAGTTGGGAAGCTACTTGCAGTCACTGAG 

CCTCAGTTTCTACATCTGTAAACTGGGGATAGTAGCATGGCCCCTACTTAATGTGCTCAGCAAAGCCACtG 

AA AG GAGA CAG AAAT G T ATC T A AAT T A C C C T G G AC TT TT AT CCTACCTCTCTT GG GG A TTGT CAC C AC CTT 

CCCATGTTTGTCCTTTTTGGTTTGATGCTTGCTGTCACTTCTT^ 

Tl^ATCCCAGGGATTCCAGAGl-TACAGCACATGCATACCACCATCCAAGCATGTTTATTTGTCTCCTGCTTC 
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ACTAGGCTGTCCCCAAGfeAACATGTGGCTCCCGGCACACACCTGGCACAACACTGCACATGAC^TTC^CC^ 
ACTTGGCCTTGAATCTGACAAGGAATCTGGCATGATGTTCACCCACTCAGGCCAGGTGCCGAGCAGCCCTG 
GAGGCTTAGGGGCCAGAGGGATGGGAAAAGGTGTCTTTCTGGGGTGAGTATCAGTTTCTGCAGGAGGGCTG 
AATGTGAGAAAGAATAAAGAGAGAAGGAAGCGAACAAGCACAGCTTAAACATCGCCTATTTCTATTGAGTT 
TTAAGAACGCTGTGATTTTGTTTGTCATGCAATCCATTCATCAGGCCAGGCAGACACAGAACTTGGGTGTG 
AGTGACGATAATGAGCTGATATAATTTTCACACCCTCATCACTGAGATCTCTCCCATCAGGAATGGGTCAG 
GGAGCTCACAGGTGGCAGCAACTGCTATTACAGGCCTCATCTCTACCAGCTCCTGGGGCCTGCCCTCCTCC 
CATTAGAAAATCCTCCACTTGTCAAAAAGGAAGCCATTTGCTTTGAACTCCAATTCCACCCCCAAGAGGCT 
GGGACCATCTTACTGGAGTCCTTGATGCTGTGTGACCTGCAGTGACCACTGCCCCATCATTGCTGGCTGAG 
GTGGTTGGGGTCCATCTGGCTATCTGGGCAGCTGTTCTCTTCTCTCCTTTCTCTCCTGTTTCCAGACATGC 
AGTATTTCCAGAGAGAAGGGGCCACTCTTTGGCAAAGAACCTGTCTAACTTGCTATCTATGGCAGGACCTT 
TGAAGGGTTCACAGGAAGCAGCACAAATTGATACTATTCCACCAAGCCATCAGCTCCATCTCATCCATGCC 
CTGTCTCTCCTTTAGGGGTCCCCTTGCCAACAGAATCACAGAGGACCAGCCTGAAAGTGCAGAGACAGCAG 
CTGAGGCACAGCCAAGAGCTCTGGCTGTATTAATGACCTAAGAAGTCACCAGAAAGTCAGAAGGGATGACA 
TGCAGAGGCCCAGCAATCTCAGCTAAGTCAACTCCACCAGCCTTTCTAGTTGCCCACTGTGTGTACAGCAC 
CCTGGTAGGGACCAGAGCCATGACAGGGAATAAGACTAGACTATGCCCTTGAGGAGCTCACCTCTGTTCAG 
GGAAACAGGCGTGGAAACACAATGGTGGTAAAGAGGAAAGAGGACAATAGGATTGCATGAAGGGGATGGAA 
GGTGCCCAGGGGAGGAAATGGTTACATCTGTGTGAGGAGTTTGGTGAGGAAAGACTCTAAGAGAAGGCTCT 
GTCTGTCTGGGTTTGGAAGGATGTGTAGGAGTCTTCTAGGGGGCACAGGCACACTCCAGGCATAGGTAAAG 
ATCTGTAGGTGTGGCTTGTTGGGATGAATTTCAAGTATTTTGGAATGAGGACAGCCATAGAGACAAGGGCA 
AGAGAGAGGCGATTTAATAGATTTTATGCCAATGGCTCCACTTGAGTTTCTGATAAGAAeCCAGAACCCTT 
GGACTCCCCAGTAACATTGATTGAGTTGTTTATGATACCTCATAGAATATGAACTCAAAGGAGGTCAGTGA 
GTGGTGTGTGTGTGATTCTTTGGCAACTTCC^GGTGGAGAAGCCTCTTCCAACTGCAGGCAGAGCACAGG 
TGGCCCTGCTACTGGCTGCAGCTCCAGCCCTGCCTCCTTCTCTAGCATATAAACAATCCAACAGCCTCACT 
GAATCACTGCTGTGCAGGGCAGGAAAGCTCCATGCAC7,XAGCCCAGC^GAGCAACACA3 
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Figure 6 . 

CTGGTTCATCTCATTGGGAC'TGGTTGGACAAGAGGGTGCAGCCCACGGAGGGTGAGCCAAAGCAGGGTGGG 
GCGTCGCCTCACCTGGGAi.GCACAAGGGGTCGTGGAATTTTCTCCCCTACCCAAGGAAAGCCATAAGGGAC 
TGAGCCTGAGGAACTGTGCACTCTGGCCCAGATACTGCACTTTTCCCATGGTCTTTGCAACCCGCAGACCA 
5 GGAGATTCCCTCCGGTGCCTATGCCACCAGGGCCCTGGGTTTCAAGCACAAAACTGGGCAGCCATTTGGGC 
AGACACCGAACTAGCTGCAGGAGTTTTTTTTTTTTTTTTCCATACCCCATTGGCACCTGGAACGCCAGTGA 
GACAGAACCGTTCACTCCCCTGGAAAGGGGGCTGAAACCAGGGATCCAAGTGGTCTGGCTCGGTGGGCCCC 
ACCCCCATGGAGCCCAGCAAACAAAGATTCACTTGGCTTGAAATTCTTGCTGCCAGCACAGCAGCAGTCTG 
AGATTGACCTGGGACCCTCGAACTTGGTTGGGTGCTGTGGGGGGGCATCTTCCATTGCTGAGGCTTGAGTA 

10 GGTGGTTTTACCTTCGCGGTGTAAACAAAGCTGCTGGGAAGTTTGAACTGGGTGGAGCTCACCACAGCTCA 
GTAAGGCCACTGTGGCCAGACTGCCTCTCTGGATTTCTCCTCTCTGGGAAGGATATCTCTGAAAAAAAGGC 
AGCAGCCCCAGTCAGGGACTTATAGATGAAACCCCCATCTCCCTGGGACAGAGCCCCTCGGGGAAGAGGTG 
GCTTCCACCATTGTGGAAGACTGTGTGGCAATTCCTCACGGATTTAGAACTAGAGATACCATTTGACCCAG 
CAATCCCATTACTGGGTGTATACCCATAGGATTATAAATCATTCTACTATAAAGACACATGCACACTTATG 

1 5 TTTATTGTAACACTATTTACAATAGCAATGACCTGGAACCAATCCAAAAGCCCATCA^.TGATAGACTGAAT ' 
AAAGAAAATGTGGCACATATACACTGTGGAATACTATGCAGCCATAAAAAAGGATGAGTTCATGTCCTTTG 
CAGAGACATGGATGAAGCTGGAAACC^TCATTCTCAGCAAACTAGCACAATAACAGAA^ACCAAACACTGC 
ATGTTGTCACTCATAAGTGGGAGTTAAACAATGAGAACACATGGACACAGGGAGGGGAACGTCACACACTG 
GGGCATGTCGGGGAGTGGGGGCCTACGGGAGGGATAGCATTAGCAGAAATACCTAATGTAGGTGACGGGTT 

2 0 GATGGGTGCAGCAAACCACCATGGCACATATACACCTATGTAATAAAACTGCACGTTCTGCACATGTACCC 

CAGAACTTAAAGTATAATTAATAATAATAATAATTTCTGGGCATGTAAGTAGCTGTCTTXCAGGTTCTACT 
TTGATACATATTCTGAGAGAATTAAACCTGTCAAAGAAACCTTGACTTTCAATGGCAGGCACTGGAATTGA 
CCCTAATAATGTGTTTTGGGGTAAGCCTACTCATATTCTCAACCTGTCTGCAGTAGTCGTTAGAATCTGAA 
CTTCCTGAAGTTCATGTGCAAAGTTGAGTTAATTGTTTAATATTCAACAAGGATTATGCCAGTAAGATGGT 
25 AGGAAAATATTAGATATGTGTCATCACTGCTGGTATTATTTAAACTGCAACATATTTIAGCTGGCTGCTGA 
TCTCAGCCACCATGCCTGCATTTTATCTCTGTCTCGTGGTCTGCAA.CCTTGGAAGCTTTGAACTTAGCTCA 
TAGAATCCTGGGCATCAAGA^CATGTGGTTCTAATGGCTAGATAGGGAATGAGAGTA^J^.GGATTTTGCCC 
ACGGTCACGTGAGTAAACAACAGATTTGGAGGGGTCTGGACTACTGTGATGACTTCATTCTGACAATATGT 
TCCAGTTGTCCTTTCATTTCCTCCTAATCACATGTCTGGTCTGATCTGGCTGTTTCCChCCTTCCAATTCC 

3 0 TGCCTTCTCCAATGCTCCCTTCCGTAGGTCACTCTGTGGCTCAGAGACCCTGCTTAGCAp.GCGCCCAACCT 

TTCAATTATTTGTTCAGTAA.^CTTGAACTCATGTCTCCCCTTCTTGATAAAAAGA"_-ATACGTTATGT^ 
TGTCGGGTTACTCT/iTAACTCTTGTCCTGTCTCTCGGCAACTACTGAACTAACTGTTTTCATATTGAGCAA 

acgtttatggaa.ggactgccaa.gagtcaggtactaggcttggtaa.tattccccgttc'tctctagtcaaagc 
caacaccagccagac-ti^gcagatctaggtcccaagcccactgcagatcacaggccac-c-gtctggtctcctc 

3 5 tgagctcctttgggagggaaagacaga^ttatt^j\cacccattttgtagattaggca-.ctgaggctgagga 

agtttaaataactcagacagggcctgcacgtcagtcatattcca^ggatccctactcactgtcttctctct 
acaga^cgagatgtctctggagtccataga\agcccaggagcctggctgggcacggtggctcctgcctgta 
atcccagcactttgggaggccgaggcaggcagatcacctgagctcaggagttca^gaccagcctgggcaac 

ATGGCAAA^CCCCATCTCTACTAA^AAATACAAAAA^vTTAGCTGGGCGTGGTGGTGCATGCCTCTAATCCCA 

4 0 gctacttgggaggctgaggcacaagaattgcttgagcccaggaggcagcagttgcagtgagctgagattgt 
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GCCAGTGCACTCCAGCCTGGGCAACAGAGCAAGATTCC^ 

ACAAAAATAGAAAGCCCAGGGACCACCTGCGTCAGGTTCCCAGCCACACCTTTTTCTTGTCCTCCTCTGTC 

TCTGGCATCTTCTCACAGGTTCCTAATTGTTTGTGGTTGCACAAATTCAAAATCCCAGAAAAATTACCACT 

TCACACCCACTCAGATGGCTATTTTTTTTTTGAAGGAAGATAACAAGTGTTGACAAGAACATGGAGAAATT 

GGAATTCTCACCCATTGCTGGTGAGAATGTAATACGGTGCTGCTGCTATGGAAAACAGCTTGGAGTTTCCT 

CAAAAAGTTCAACAGAATTTCAATGTGACCCAGCAATTCCCCTCTAAGTTATAGATCTGAGAGGATTAAAA 

ACAGTTACTAAAATACACGGACTCACATATTTCTAACAGTCCAATTCACAAGGGCCAAAAGGTGCTAATAG 

CCCACATGTCCATCGATGGATGGATAAATAAATTGTGGTCTATCCATACAATGGAATATTATTCGGCCATA 

AATGGAATGAAGTACTGACGCATGCTACAGAATGGATGAACCGCAAAAAAAATGGATGAACACATGCTACA- 

GAATGGATAGCCTCACTTTACTATGAAGTGAAGGCCAGAAACGAAGTCCATATATTGCATCATACAAAATA 

TCCAGAAGAGGGAAGCCCACAGAGACAGAATGTGCAATGGTGGATGCCAGGGTCTGGGGAGAGGGGAGAGT 

GGGGAGAAACTGCTCAACTGGTACAGGCTTTATTTTGGAATGATGGGAACATTTTGCAACTAGATAGAGGT 

AGTGATTGCAGAACACAGAATGTACTGAATTCCACTGATTTTTTTCACCTTAAAATGGTTAATTTTCAGTC 

CTGAGATTGGATAATCATAAAAAAATGGTTAATTTTATGTTATGTGAATTTCATCCCTATACATATTTTAA 

ACCTCAGAAATATACACTAGCAGGCATGGAACAGGTCACTGTGGTGCCTGCCAAGCCCGGTGATGTTATCT 

GGGGTCCCCGGCCAGCCTTAAGCCTCTTGCTGACCGGTGGAGGGCAGAACCTTTGCCCTAAAAGTATAATA 

TCCACATGCTGGCATGATTCCTGGCCAGATGGCTTCTTTATTAGCAGTAATTGAAACTGCCTCGATACAGA 

CACTGTACCTTGCAACCAAAAAATGACTCAACAATGATAATAAGGGTTAAGCTGGGCCTTTCTCTCTTTGC 

CAGTTAAATTATATTTATTATAGCTTGACATGAAAAACAAJVGCAA 

GACATGAACATTTTATCAAAGAAGAAATGCAGCTGTCAAAAATACAGAAATATTCAACCTTGTTCATAATA 
AAGTGGCTGGGCTCAGTGGTTCATGCCTGTAATCCCAGTGCTTTGCAAGGCTGAGACAGGAGGATCATTTG 
AAGCCAGAAGTTCAAGACCATCCTAGGCAAGTCAGTTCAATACCAGACTTCATGTCTACAAAACATCAAAA 
AATTAGCGAGGCATGGTGATGCATGCCTGTTGTCCCAGCTACTCAGGAJ3GCTGAGGCAGGAGAATTGCTTG 
AGCCTGGGAGGCTGCGGTGGCGGTGAGCCATGATTGTGCCATTGTACTCCAGCCTGGGCAATGCAGCAAGA 
CTGTCTAJ^TAACAAAAATAATAGTAAAGAAAAGGATTGGGATGCCATTTACTTGCG'TATTCAATACACAG 
AGTTAAAAGTAATTTCTACGTTTTCTATTTTTTTATTACTAAAAAAAGCTGGACCATTCTCACAGCCTGAA 
ATGCTTCTCACTTTCCCTTCTTCTGTCCAAACACTTCTCTATGATAATGGAAACAG.TCACTCCTTTAGGAA 
GACTTCACCCCAGGTAGTTCCAGATCCCCTTATCTCTGCCTTCCCAGAACTCCTGGTGTCTCTCCAGTTCC 
CTCCGTGTGGTGAAGTACCCTACCTAGGGTTTCAGTATGGCTCTGTCTGCAAAGGTCTTGTTCACACCTTC 
CCTTATGGTTCTGTTGCCCTGTGTTGTGTCATAGCACAGGGCACAGTGGAGAACCCATTCACACTGATAGA 
GAGGGCCCCATGGTCCTGGAGATAACCATGTAACCGATCAGAATAAGGCATTGAGGGCTGGGTGTCAGGCG 
TGGGCTGCACTTGGGTGGGCAGGTCCCCTGGAAAGTCACTGGGTTTGGCAAGCTTCCTAGTAACATGTCTC 
TCTGGGGTCCCCCTTGGAACTTCATGCAAAAATGCTGGTTGCTGGTTTATTCTAGAGAGATGGTTCATTCC 
TTTCATTTGATTATCAAAGAAA.CTCATGTCCCAATTAAAGGTCATAAAGCCCAGTTTGTAAACTGAGATGA 
TCTCAGCTGA.ATGAACTTGCTGACCCTCTGCTTTCCTCCAGCCTCTCGGTGCCCTTGAAATCATGTCGGTT 
CAA.GCAGCCTCATGAGGCATTACAAAGTTTAATTATTTCAGTGATTA.TTAAACCTTGTCCTGTGTTGACCC 
GAGGTGAATCACAAGCTGAACTTCTGACAAGAACAAGCTATCATATTC 

G TT A A TTG AT AG G ATTTT TTTTG T T T AA AAA A AAT G TT A C T AG T TTT G AAAAG G T AAT AT G TG CLACAT G GT 

AAACACTAAG.^AGGTATAAGAGCATAATGCTTTTATACTACTAAGAATAATGTTTTCTCTAA 

GGTAGATGCTCT'CATCAGATTAAGAAAATTCCCTGCTATTAGT^ 
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AGTTGAATATTATTATCATATATTATTAATATATTGTTATTGAACTATCAAAGCCTTTTCCTAAAACCATT 
GAGATGATCTTATAACCATTCTCCTTTAACCTGTTGACGAGATCATTGGTATTTATACTATTTCTCTGTTA 
ACCATTCTTGAGTCTCAGGTTTAAATTCAACTTGGTCATGGTGTGTCATCTTTGATCATTGCTGTCTGTGG 
CTTGCTACTGTTTTGTTTAGGATTTTTGCACTGATGCTCATCAATGAGACTGGCATGCCATCTTCCTTTGC 
5 AGTCCTGATTTTTTTCTGATTTGGATCATGTGGTTATGGCCCTCATGGAATGAGTTGGGCATGATGCCTTT 
TTTTCATGTCTCTGGATTGATGGGACACTTTGGATTCTCTCCAGATGGCCCTCAATGGTCCCTGCCTCCTC 
ATTGTTAGGCCCCTGGGCAAGCCCTTCTCATTTCTGGTAGGCCCAGGAACCTGTGGGGGTTTTGTTTGTTT 
GTTTGTTTCTTGAGTCGGAGTCTCACTCTGTCACCCAGGCTGGAGTTGGAGTGCAATGGCCCGATCTTGGC 
TCACTGCAACCTCCACCTCCCAGATTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGAATTACAG* 

10 GCACCCACCGACACACCCTGCTAATTTTTGTATTTTTAGTACAGATGGGGTTTCACA^TATTGGCCAAGCT 
GGTCTCGAACTCCTGATCTCATGATCTGCCCGGCTTGGCCTCCCAAAGTGTTGAGATTACAAGCATGAGCC 
ACCACACCCAGTGAACCTGTGGTTTTTAGAAGCTCCCCATGCATGTGAATGCTGTGAGCATCCCAGGATGA 
CAGCCACTGTGTGTTCAGCTGTTGGAACTGTGAGAAAGCACCAGTGGGACCTTCTCCAGCACCTGCCTGCT 
GAGTTCATGGAAGAGGCTTGTTGGGGAGATGATGCCCTGGCTGACTCCTGAAGGATGGTTAGGAATGCACC 

15 AGATGGAAGCTGGGTTGGACCCACTCTATGCTGAAGAACAGCTTGTGTGGACACAAGGAGACA.CGGATATG 
TCATTTTTGTAGAGCCTGAGGAGTGTCCAATCACACCATTTGCTTAAAACATCATGCACACTTGGAAAAGT 
GGACTGAGACCGAATGAAGAAGCTAACAGTGGCCAGATCAGAAAGGGTCTTGTGTTACTTCCTAGAGATAC 
TTAGATTTTATCCTGTGGGTGATAGGAGCAGTTGGAGGGACTGAAGACAAGGAAAGAAACATGTTTCAAGA 
TCTATGTTTTTCAAGACGCTTTTCTGGTGGCTGAGTAGGGAATTCCCTGGATAAGTCCTGCCCAGGGTCAG 

2 0 GCAAAACAAGTTAGGGGGTTA.CTGAAATAAGGAGTATGAGAAATGGTGTAGGTTGTGCTGACGTTTTGTAA 
CACATCTCATGATGATCTTCATTTCCTTCACTAATTTCCTGTTTCATTAATTCCCTTCCACGTGCTCTTCT 
GAAATTTGCCTCACATTCTCTGATTTCTCTTTTACCTGTTGGTTTCATCACCTTTTACTTTTTGCTTTCCT 
GGAAACACAAATGATTCTGATTGTGACATGTCAGAATTATTTGCAACATTTGCCTTTCTGCTGAAACCATG 
AGTTCACTGAATACACAATTTAGTAAAGTGTAGGATGCACATGTCGTTTTCGTGGTCACAACCAGCTCTGT 

2 5 ' AGCATTTTATAACTACACTGGCAGTGTGCTGGGAGGTGTAGAGAGAAATATTTATCACATGTGTGGCTGAC 

ACAACCTGCCAAGTTATTTTAGGAGCCTCCTTGGAATCCCAGC^GAATGCTACCGGC^.CAATTTGTAATC 
ACAGCATCCTGCTCCATGCCTTGGCTTCATGGCATAGTCACTTCTGCAAGTCTCTTTCCAGCTGTCTGTTC 
CC^TGTCTATAAAGTATGAGTTAAATCATCCTAACACTACTCATCt'TACAAA.GTTTTCTTGCTGATGTTAA 
GAGAGTTGGGAAAGAACTGTATAAACTGTGAAGTGCCATGGAGATGTTAGTGGTTACTTTATCAAGAAATA 

3 0 GACACTCTAGAATGGAGTAGAAAGcbAACAGTTATGATTGAGTCCTCCTCCTCTTCTTCTTTTTATTAATT 

TATAAAGAAAAGAGGTTTAATTGACTCACAGTTCCATATGGCTGGGGAGGCCTCGGG.-_^.CTCTCAGTCAT 
AGCAGGAGGCAAAGGGGAAGAAGGCACCTTCTTCACAAGGCGGCAGGAGAGAGAGAGCTCCTGTTCTTTTT 
TGTCATAAAGTCTAGAGAAGTGCTTATACTTCAGGACAAGGGCAGGCAGAGAGAAC-G.^GGACATTGCTTC 
ACCCCAGCCCTCACTGACGAGTTTGCTAGGGGACCTCACTTTGTCCCAGAGTAGGGCAGA^CTCTGGCCAC 

3 5 TACCCATTCAGAAGGCCTGGGCTGCACTGCTAGTTCCTCACTAACTCTGTGTGGCCTTGGGCAAGGTTGGG 

CCTGTGTTAACAGATTATGACCCTGGGCTCTCAAGCTAGAGGATCTAAATTTGAATCCTGGCTCTGCTAAA 
GC/lATTAGTGATGTAAACTTT^TGGGTCAGTTA.ACCTTCCTGTGGCTTAGTTTGCTCATCTGTAAAAT^G 
GGATCATAACAGTATCA.ATACCACATGATTGTTGGACAGATTGAATCAGTTAATGCAGGGGAAGTACTTAG 
CATGACACGTATTCACTATCATTTCCTGGAGTAAGAGCTGTGTGTGAGTGGGTGTGAGCATGTGTGAAACC 

4 0 TTTTCTCTGCAATCTCAGTTAAGAAACCAATCCAGAATTTAAAGTTCAGGGCCTAA^TGGGTGGTTATCTT 
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CTCCCAGTTCCATCCTATCCCACCTTTGCTCTTCCTCCCGCCCACAGGAGCTGTTGGTCCTTGATTGGGCT 
GGAAGACCTGGTGG^CCCTAAGTGATCTATAAGAGGAGAATAGAGAACAGGGAATGTCTTCAAAAATCTAG 
AGGGACACAGAGGCTGAGAGGCAGGCAGTCCTGCAGGGTCTTCTGATTGGGACAAGGAGAACCTTGGTCTT 
CACAGGCCA^TTCTGGTCAGTTTCCCCCATGGACAGATGAGGAAACAGGCCCAGGAATATCCAAGGTCTCA 
CACTTCCCATCTGTCAAGTCTTGTTGATTCTGTTGTATTCATGTCTCTCAAAGGGAGATAGAGTTTAGGGA 
AGAAAGAAGGATCAACTGTGTCTGATACCACTGGGAGCTTAAGTAAAGGGTTCTTTTACTTCATAGCATTT 
ATCCCAATTTGTAATTCAGTATTATTTGTGTGGCTGTTTGGTGTCTCTTTCTCCTATATGAGTGCTAGCTT 
CATAAGGGCAAGGATTTTGATTCTTTAATATTTAGTGCTTGCCACATGCCCTGAACACAGCAGGCATACAG 
GCTAACCAACATACAGTGGCATGAAAGTCATGAAAGTGAGACACCTACCTCCTCCAGTGCCAAGAGAGCAT 
AACCATGCACCTGTCACTCTCCTCAACACCACCCCCAAGCATGAGGCCCAAAAGCATTAGCTAATCCCCTC 
CTCCAGCCACTAAAACTTAAAGGCCAGGTGTGGTGGCTCCCATCTGAAATCCCAGAACTTCAGGAGACAGC 
AGCAGGAGGATCACTTGAGGCCAGGAGTTTGAGATCAGCCTGGGCAACATAGCTAGGTCCCATCTGTACTA 
AAAATTAGCTGGGCGTTGTTGCATGCCTGTAGTCCCAGCTACTAAGGAGGCTGAGGTGGGAGGATCACTTG 
AGCCCAGGAGGTGGAAACAACAGTAAGCTATAATCACAGCACTGAACTCTAGCCTGGGCAACAGAGTGACA 
CCCTGCCTCAAAACAATTTTAAAAATAAATAAGAGCAAAACTTAGATACCACGTGGTCACCCCAACATGCA 
AAATCAAGTTTTCCCCTACTGAGAAGAATGGGGACTTGACAGCTGAGTTACAGAGAGATAATCTTCTTCTT 
CTTTTTTTTTXTTTGGTTTACATCCTCAAGATCATGACTTGTGAAATTTGAATCGAATACACATGTAATTC 
CAGAGCAATGTTGCCTCCGCATACCATCAGCAATTCACTTGGCTACTGGAAGTCAGGAT 
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Figurya 7 . 

TCTAGAGAGA TGGTTCATTC 
GGTCATAAAG CCCAGTTTGT 
TGCTTTCCTC CAGCCTCTCG 
5 CATTACAAAG TTTAATTATT 
AATCACAAGC TGAACTTCTG 
AGTAAGTTAA TTGATAGGAT 
TAATATGTTG CACATGGTAA 
AAGAATAATG TTTTCTCTAA 
10 CCTGCTATTA GTTGTTGAAG 
TATTATTAAT ATATTGTTAT 
TTATAACCAT TCTCCTTTAA 
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CTTTCATTTG ATTATCAAAG 
AAACTGAGAT GATCTCAGCT 
GTGCCCTTGA AATCATGTCG 
TCAGTGATTA TTAAACCTTG 
ACAAGAACAA GCTATCATAT 
TTTTTTTGTT TAAAAAAAAT 
ACACTAAX3AA GGTATAAGAG 
GTTTTTTTTG GTAGATGCTT 
GTTTTTATAT CATAAATGAA 
TGAACTATCA AAGCCTTTTC 
CCTGTTGACG AG 



AAACTCATGT CCCAATTAAA 
GAATGAACTT GCTGACCCTC 
GTTCAAGCAG CCTCATGAGG 
TCCTGTGTTG ACCCCAGGTG 
TCTTTTCAAT TACAGAAAAA 
GTTACTAGTT TTTGAAAAGG 
CATAATGCTT TTATACTACT 
TCATCAGATT AAGAAAATTC 
AGTTGAATAT TATTATCATA 
CTAAAACCAT TGAGATGATC 
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Figure 8 . 

GGATCCAGTTTCAGCTTTCTACATATGGCTAGCCAGTTTTCCCAGCACCATTTATTAAR.TAGGGAATCCTT 

TCCCCATTGCTTGTTTTTGTCAGGTTTGTCAAAGATCAGATGGTTGTAGATGTGTGGTGTTTGTTCTGAGG 

CCTCTGTTCTGTTCCATTGGTCCATATCCCTGTTTTGGTACTAGTACCATGCTCTTTTGGTWCTGTAGCC 

TTGTAGTATAGTTTGAAGTCAGGTAGCGTGATTCCTCCAGCTTTGCTCTTTTTGCTTAGGATTGTCTTGGG 

AA.TGTGGGCTCTTTTTTGGTTCCATATGAAATTTAAAGTAGTTTTTTTTCCAATTCTATGAAGAAAGTCAT 

TGGTAACTTGATGGGGATGGCATTGAATCTATAAATTACCTTGGGAAGTATGGCCATTTTCACGATATTGA 

TTCTTCCTATCCATGAGCATGGAACATTCTTCCATTTGTTTGTGTCCTCTTTGATTTTGTTGAGCAGTGGT 

TTGTAGTTCTCCTTGAAGAAGTCCTTCACCTCCCTTTAATTTGGATTACTAGATATTTTATTCTCTTAGTA 

ACAATTGCAAATGGGAGTTCACTCATGATTTGGCTCTCTTTCTGTTATTGGTGTATAGG^TGCTTGTGAT 

TTTTGCGCATTAATTTTGTATCCTGAGACTTTGCTGAAGTTGCTTATCAGCTTAAAAGGATTTTGGGCTGA 

GACGATGGGGTTTTCTAAATATACAATCATGGCATCTGCAAACAGGAACAATTTGACTTCCTCTTTTCCTA 

ATTGAATACCCTTTATTTCTTTTTCTTGCCTGATTGCCCTGGCCAGAACTTCCAATACTATGTTGAATAAG 

AGTCATGAGTGAGGGCATCGTTGTCTTGTGCTGGTTTCAAAGTTTTTGCCCATTCAGTATGATTTTGGCTG 

TGGTTTTGCCATAAATAGCTCTTATTATTTTGAGATACGTTCCACCAATACCTACTTTATTGAGAGTTTTT 
AGCAGGAAGGGCTGTTGAATTTTOTrna nr;r:rT"r'r-i-T^orT.A „ ■_ 




— ^ wi ^. u » llmulurt4UUttu «.(_HiTTAi rAATTTGCATATGTTGAACCAGCCTTGCATCCCAGG 

GATGAAGCCCACTTGATTGTAGTGGATAAGCTTTTTGATGTGCTGCTGGATTCAGTTTGCCAGTATTTTAT 

TGAGGATTTTGGCATCAATGTTCATCAGGGATATTGGTCTAAAATTCTCTTTTTTTGTTGTGTCTCTGCCA 

GGCTTTGGTATCAGGATGATGCAGGCCTCAGAAACTGAGTTAGGGAGGATTCCCTCATTTTCTATTGATTG- 

GAATAGTTTCAGAAAGAATGGTACCAGCTACTCTTTGTACCTCTGGTAGAATTCAGCTGTGAATCCATCTG 

GTCCTGGACTTTTTGGTTGGTAGGCTATTAATTATTGCCTCAATTTTAGGGCCTGTTATTGGTCTATTCAG 

ACATTCAACTTCTTCCCGGTTTGGTCTTGGGAGGGTTTATGTGTCCAGGAATTTATCCATTTCTTCTAGAT 

TTTCTAGTTTATTTGTGTAGAGGTGTTTATAGTATTGTCTGATGGTAGTTTGTATTTCTGTGAGATCGGTG 

GTGATATCCCCTTTATCATTTTTTATTGCATCTATTTAATTCTTCTCTCTTTTCTTCTTTATTATTCTGGC 

TGGCGGTCTGTCAATTTTTTTGATCTTTTCAAAAAACCAGCTCCTGGGTTTCACTGATTATTTGAAGGGTT 

TTTTGTGTCTCTATTTCTTTC^GTTCTCCTGTGATCTTAGTrATTTCTTGCCTTCTGCTAGCTTTTGAATG 

TGTTTGCTCTTCCTTCTCTAGTTCTTTGAATTGTGATGTTACAGTGTTGATTTTAGATCTTTCCTGCTTTC 

TCTTGTGGTCATTTAGTGCTATAAATTTCCCTCTACACATTGGTTTACATGTGTCTCAGAGATTCTGGTAT 

GTTGTGTCTTTGTTCTCATTCATTTCAAGAACATCTTTACTTCTGCCTTCATTTTGTTATTTGCCCAGTAG 
TCATTCAGGAGCAGG 




^ — «taj iiiOi lUJLufil JL X UVJJ-V J. JL i_ J. i. JL 1 i* \JJ\ X. L ' A.vJTGAGCATGC 

TTTATGTCCCATTATGTGGTCAATTTTAGAATAAGTGTGATGTGATGCTGAGAAGAATGTATATTCTGTTG 
ATTTGGGGTGTGGAGTTCTGTAGATGTCTATTCAGTCCACTGGGTGCAGAGCTGAGTGGACATGAACATTT 
TATCAAAGAAGAAACAqAGCTATCAAAAATCCAGAAAT^^ 

CTGGTTCATTCCTGTAATCTCAGTCCTTTGAAAGGCTGAGAAAGGAGGATCACTTGAGGCCACAAGTTCAA 
GACCATCCTAGACMGTCAGTTCAAGACCAGACTTCATGTCTACAAAACATCAAAAAATTAGCCAGGCATG 
GTGATGCATGCCTGTCATCCCAGCTACTCAGGAGGCTGAGGCAGGAGGATTGCTTGAGCCTGGGAGATTGA 
AGTGGCAGTGAGCCATGATTGTGCCATTGCACTCaAGCCTGGGCAATGCATCAAGAC'TCTGTCTAAaCAAT 
AATAATAATAATAGTAATAGTAATAATAATAATAATAAAGAAAACGGTTGGG ACG CC£ ^TCCTTACTTATT 
CAATACAOVAAGTTAAAAGCAATTTCTACTTTCTCTATTTTTTTATTA^ 

ACAGCCTG^AhTGCTTCTCACCTTCCCCTCTTCTATACAAACACTTCTCTGTTGATGAT^TGCAGACAGT 
CTCTCCTTTAGGAATACTTCACACCAGGTAGTTCCAGATCCCCTTATCTCTGCCTTCCCAGAGCTCCTGGT 
GTCTCCCCAGTTCCCTCTGTGTGGTG.AAGTACCCCCACCTTGGGTCTCAGCATGACTCGTTCTTTGAAGGT 
CTTGTTCACATTTTCCCTTATGGTTCTGTTCCCCTGTGTTGTGTCACAGCACTGGGCAGAGTGGACAACCC 
ATTCACACCGATAGAGAGGGCCCCATGGTTCTGGAGATAACCATGTAACTGATCAGAATAGGGCATTGAGG 
GCTGGGTGTCAGGCATGGGCTGCACTTGGGTGGGCAGGCCCCCTGGA/AAGTCACAGGATTTGGCAAGCTTC 
CTAGTAACATCTCTCCCTGGGGTCCTCTTGGAACTTCATGCCCGATGCTGGATGCTGGTTTATTCTCGAGA 
^ATGGTTCATTCCAA.TAATCAATGAAJICTCATGTCCCAACTAAAGTTCATAAACTCCAGTTTGTAAACTGA 
GATAATCTCAGCTGAATGAACTTGCTGACCCTCTGCTTTCCCCCAGCCTCTCAGTGCCCTTGAAATCATGT 
CAGTTCAAGCAGCCCCATGAGGCATTACAATGTTTAGTTATTTCAGTGTTTATTAAACCTTGCCCTATGCT 
GACCCCAGGTGAATCACAAGCTGGACTTCTGACAAGGACAAGCTATGATATTCTTTTC«J\TTACAGAAAAA 
^TAAGTTAAXTGATAGGATTTTTTAAAGATGTTACTAGTTTTGGAAAGGTAATTTGTGCACATGGTAAACA 
AG AAG G TAT AAG AG G ATAATG CTTTTATACTG CTG AG AAT AATGTTTTC TCT AATTTTTTTTGG T AG A^?G C 
TTTCATCAGATTAATAAAATTCACTGCTG1TAGTTC 

ATT AT C ATG T ATT AT T A AT AT ATT A TT ATT G AA C TAG C AAAG G C T C TT C C TAAAA CA A TT G A G ATG AT C TT 
ATAATCGTTCTCCTTTAATCTGTTGATGAGATCATTC 

GTCTCAGGTTTAAATTCAACTTGGTCATGGTGTATCATCTTTGAAGACTCCTGTCTCT^ 
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TGTGTTCAGCATTTTTGCACTGATGCCGATGAATGAGACTGGCATGTCATCTTCCTTTGCGGTCCTGATTT 
TTTTCAGATTTGGATCATGTGGCCCTCATTGAATGAGTTGGGTGTGATGCCTTCTTTTTCATGTATCTGGA 
TTGATGGGACACTTTGGAGTCTCTCCAGATGGCCCTCAATGGTCCCTGCCTCCTCATTGTTAGGCTCCTAG 
GCAACCCTTTCTCATTTCTGGTAGGCCCAGGAACCTGTGGGTTTTATGTTTGTTTGTTTGTTTGTTTGTTT 
5 GTTTTTTGAGTTGGAGTCCTGCTTTGTCTCCCAGGCTGGGGTTGGAGTGCAATGGCCTGATCTCGGCCCAC 
TGCAACCTCCACCTCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCTTCTGTGTAGCTGGGATTACAGGCAT 
CCACCACCACTCCTGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTACAATATAGGCCATTGTGATC 
TCTTGGACAGGCTAGTCTCAAATTCCTGACCTCATGATCTGCCTGCCTCAGCCTCCCAAAGTGCTGAGATT 
ACAGTTTTGTGCCTCCACACACAGTGAATCTGTGGTTTTTAAAAGCTCCTCATGCATGTGAATTCTGTGAG 

10 CATCCCGGGATGACAGCCACTGTGTGTCCAGCTGTTAAAACTGTGAGAAAGCACCAGCGGGACCCTCTCCA 
GCATTTGCTTGCTGTGGTCATGAAAGAGGCTTGTTGGGGAGATGATGCCCTGGTTGACTCCTGAAGGATGG 
TTAGGAATGCACCAGATGGAAGCTGGGTTGGACCCAGTCTATGCTAAAGAACAGCTTGTGTGGACACAAGG 
AGACACGAACACATCATTTTTGCAGAGCCTGGGGAGTAGCCAATCGCACCATTTGCTTAAAACACCGTGTA 
CAGTTGGAGAAGTGGACTGAGACAGGCTGAAGAAGCTAACAGTGGCCAGATGAGAAAGGGTCTTGTGTTAC 

15 TTCCTAGATATACTTAGATTTTATCCTGTGAGTGATAGGAACAGTTGCAGGGACTGAAGCCAAGGAAGCAT 
GCTTTAAGATTCCATGTTTTTTGAGATGCTGTCTGGTGGCTGAGTAGGGAATTCCCTGGATAAGTACTGCC 
CAGGGTAGGCAAAAGAAGCTAGGAGGTTACTGAAATAAGGAGTATGAGAAATGGTGTAGGTTTTGCTGATG 
TTTTGTAACACATCTCATGACAATCTTCATTTCCTTCACCAATTTCCTGTTTCATTAATTCCCTTCCACGT 
GCTCTTCTGAAATTTGCCTCATATTCTTTGATTTCTCTTTTACATGTTGGTTTCATCACCTTTTACTTTTT 

2 0 GCTTTCCTGGAAACACAAATGATTCTGATTGTGACATGTCAGAATTATTTGCAACATTCCCCTTTCTGCTG. 
AAACATGAGCTCACTGAATACACAATTTAGTAAAGTGTAGGATGCACATGTTGTTTTCATGGTCATAACCA 
GCTCTGTAGCATTTTATAACTACACTGGCAGTGTGCTGGGAGGTGTAGAGAGAAATATTTATCTCATGTGT 
GGCTGACACAACCTGCCAAGTTGTTTTAGGAGCCTTCTTGGAATCCCAGCAAGAACACCACTGATGCAATT 
TGAAATCACAATGTCCTGCTCCATGCCCTGGCTTCATGGCTTAGTCACGTCTGAAGTCTATTTCTAACTAT 

2 5 CTGTTTCCACATCTATAAAGTATGAGTTAAATaATCCTAATACTACTCATCTTACAAAGTTTTCTTGCTGA 

TATTAGGAGAGTTGGGAAAGAACTGTATAAATTATGAAGTGCCATGGAGATGTTGGTGGTTACTTTATCAA 
GAAATAGACACTCCAGAATAGAGTAGAAAGAAAACAGTTATGATTAAGTCCTCCTCCTCTTCTTTTTTTTT 
AATTTACAAAGAAAGGTTTAATTGAGTCACAGTTCCATATC 

GGCAGTTGGOU^AGTGGAAGAAGGCACCTTCTTCACAAGGTGGCAGGAGAGAGAGAGCTCCTCTTCTTTTT 

3 0 TGTTGTAAAGTCTACAG AAGTG C ATATACTTCAGGG CAAGG GCAGGCAGGG AG AAGAAAGGACATTG CTTC 

ACCCCAGTCCTCACTGACAAGTTTGCTTTGGGACTTCATTTTGTCCCAGCATATGGGACAGAGCTCTGGCC ' 
ACTACCCATTCAGAAGGCCTGAGCTGCATTGCTAGTTCCCCACTAACTCTGTGTGTCCTTGGGCAAGGCTG 
GGCTTATGTCAAAAGATTATGACCCTGGGCTCTCCAGCTACAGAATCTACATATGAATCCTGGCTCTGCTA 
GAGCAATTAGTGACGTAACCTTGGATGGGTCAGTTAACCTTCCTGTGGCTTAGTTTGCTCATCTGTAAAAT 

3 5 AGGGATCATAACAACATCAATACCATGGGTTGTTAGACAGATTGAATCAGTTAATGCAGGGTA7VATACTTA 

GCATGACACGTATTCACTATCATTTCCTTGAGTAAAAGCTGAGTGTGAGTGGGTGTG.AGAATGTGTGAAAC 
CCTTTCACTGCAATCTCAGTTAAGAAACCCATCCATAATTTAAAGTTCAGGGCCTAJLATGGGTGGTTATCT 
TCTCCCAGTTGCATCCTATCCCACCTTTGCTCTTCTCC.TGCCCGTAGGAGCTGTTGGTCTTTGATTGGGCT 
GGAAGACCTGGTGGACCCTAAGTGATCTATAAGAGAATGAGAATAGAGGACAGGGA^.TGTCTTCAAAACTC 

4 0 CTAGAGGGACACAGAGGCTGAGAGGCAGGCAGTCCTGCAGGGGTCTTCTGATTGGGACAAGGAGGACCTTG 

GTCTTCATAGGCCAATTCTGGTCAATTTCCCCCATGGA^GATGAGGAAACAGATCCA.GGAATATCC^GG 
TCTCACACTTCCCATCTGTCAAGTCTTGTTGATTCTGTTGTATTCATGTCTTTCAJ^.GAGAGAGAGAGTTT 
AAGGAAAGAAAGAAGGATCAACTGTGTCTGATATCACTGGGAGCTTAAGTAAAGGGTTCTTTTACTTCATA 
GCATTTTTCCCAAT-TTGTAATTCAGTATTATTTTTGTCACTGTTTAGTATCTCTTTGTCCTATTAGAGAGA 

4 5 TAGCTTCATCAGGACAAGGATTTTGATTCTTTAATATTTAGTGCTTGCCACATGCCCTG.AACACAGCAGGC 

ATACAGACTAACCAACATACAGTGGCATCGAAGTGAGACACCTACCTCCTCCAGTG-CTAGAGTACATGTC 
CATGGACCTGTCACTCTCCTCAACACCACCCCTAAGCATGAGGCCCGAAAGCATTGrT.^ATCCCCTCCTCC 
AGCCACCAAAACTTAAAGGCCAGGTGTGGTGGCTCCTATCTGAAATCTCAGAACTTrAGGAGACAGCAGCA 
GGAGGATCACTTGAGGCCAGGAATTTGAGACGAGCCTGGGCAACATAGCTAGACACCATCTGTACTAAAAA 

5 0 TTAGCTGGGCATGGTGGTATACCTGTAGTACCAGCTACTAAGGA.GGCTGAGGTAGGAGGATCACTTGAACC ' 

CAGGAGGTGGAAGCTACAGTGAGCTATAACCACAGCACTGAACTCCAGCCTGAGCAACAGAGTGAGACCCT 
G C C T C AAAAC AATTTC AAAAATAAAT AAAT AAAAAC AAAACTTAG ATAC C ACGTGG 7 C AC CC C AACATG C A 
AAATCAAGTTTTCCCCTACTGAGAAGAATGGGGACTTGAGAGCTGAGTTACAGAGAC-ATAATCTGCCTTTT 
TTTTTTTTTTTTTGGTTTACATCCTCAAGATCATGACCTGTGA^ATTTGAATCTA-.TACACAAATCATTCC 

5 5 AG AG C AAT G T T G C TT C T G C C T A C C AC G AG T AATT C AC TTG GCCACTGG A AG T C AG AA C PAG C T T C C GAG AA 

GAGAGGTACCACTTGGACTACCAATATAAAAGGATGAA7vATATCGGAGTGAAGGTGTTCCTTGCATCACTG 
AGTCCCTGGACAGCCTGTCCACTCATGCTGATATCTGAGCCTAATGCTTCTCTGA^.TGTTGAGATTTAACT 
TTGATCCAATGAAACCAGACCAAGAAAGAAGAAACGTCTTT 
AATTTTATGATTATTrcrCCTTAGCTO 

6 0 GTACAAAGTTGACCAAGACCAAGAATAATGTCTGGGAGCACAATACTGACAGCACAGCTTTAAAAACATGA 
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TGAATGCTTTAATACAGGAAATGAGTAGGGGAGAGGCAAGTGGTGCTTGGGTGTTCTTCCAATGCATAGTA 
TCTTCCTTGACACAGTCAGTGCAGCTCTCAGTAGGCAAGTCCCTACATGTTAGAAGATGTTACTTTCTGTG 
GAATTAGGTGGCAGAACATGCCTTCAATTATTTTCCTTTGCAGAACAACACCAATTTCATTAGTTAGGACA 
GAGTGCTGGCTGCATTTGAATTCCAAGCAACGATTAGTCTATCACTGTTGGTATAGATTCCAACCAGTCAC 
5 ACCACCTCCTGAAGTTTGTTGGGCAGGTAAATCTTCATCTTAGAATAAAAATCATCTTAGCCAAGTAAGTG 
TTTTAGAGGAAAGAAGAAAACATAATCGTTTCCATAAGAGTTTTGTTTCTAAAAAAATAAGAAAGGCTCTT 
TGTTTAGGTGAGCTAATGAAGTTGTTGATAGTTATCAGATGACACTGGAATCTTTACTTGCCAGAATGTGT 
TCTGTGCACCTCTCGGTGTGGCAACATAGAGAGGGAGATCCTCCAGCAATGCCATTGATATGGTCAGAAAC 
TGCATCTTTCTTTCTCCCTGCTGAGATGGGGTCCTTTGTTCTAGAAAACCCAGGGGGTGCCACTGGGAGTA 

10 ACCCTTGAGACAqGAACACGAATCTCAACCAATTTCTGGTTGCAGCCTTGAGTCTTACTATTTGCCATAGT 
GATGCTTAGCAAGGAATGGCAGGTGCACCAGAGCAGCAGAGGACCTAATATCTCCCTTCCTGTTAACTTTT 
TATAATATTTTATTGTGATCAGTATCAGTTGGGAAGCTACTTGCAGTCACTGAGCCTCAGTTTCTACATCT 
GTAAACTGGGGATAGTAGCATGGCCCTATTTAATGTGCTCAGCGAAGCCACTGAAAGGAGACAGAAATGTA ■ 
CCAGAATTCCCTGGACTTTTATCCTACTTCTCCTGGGGATTGTCACCCACCTACCCGTGTCTGTCCTTTGT 

15 TGCTTTGACGCTGTCACTTCTTTTCTTAGGTACCTCTCTGTAGGGCTCCATTATTCCAGGGATTCCAGAGT 
TACAGCACATGCATACCTCCATCCAAGCATGTTTATTTGTCTCCTGCTTCACTAGGCTGTCCCCAAGGAAC 
. ATGTGGCTCCCGGCACATACCTGGCACAACACTGCACATGACATTCACCCACTTGGCCTTGAATCTGACAA 
GGAATCTGGCATGATGTTCACCTGCTGAGGCCAGGTGCCGAGCAGCCCTGGAGGCTTAGGGGCCAGAGGGA 
TGGGAAAAGGTGTCTTTCTGGGGTGAGTATCAGTTTCTGCAGGAGTGCTGAACCTGAGAAAGAATAAAGAG 

20 AGAAGGAAGTGAACAAGCACAGCTTAAACATCATCTGTTTCTACTGAGTTTTAACAACTCTGAGATTTTGT 
TTGTCATGGAATCCATTTCTCAGGCCAAGCAGACAGAGAACTTGGGTGTGAGTGATGATAATGAGCTGATA 
TAATTTTCACACCCTCATCACTGAGATCTCTCCCATCAGGAATGGGTCACAGGGCTCACAGGTGGCAGCAA 
CTGTTATTACAGGCCTCATCTCTACCAGCTCCTGGCACCTGCTCTCCTCTCATTAGAAAATCCTCCACTTG 
TCAAAAAGGAAGCCATTTGCTTTGAATTCCAATTCCACCCTCAAGAGGCTGGGACCACCTCATTGGAGTCC 

25 TTGATGCTGTGTGACCTGCAGTGACCACTGCCCCATTGTTGCTGGCTGAGGTGGTTTGGGTCAACCTGGCC 
ATCTGGGCAGCTGTTCTCTTCTCTTCTTTCTCCCCTACTGTTTCCAGACATGCAGTATTTCCAGAGAGAAG 

gggccactctttggcaaagaacctgtctaactttctatctacggcaggacttttga^agctacagaggaag 
aagcacaaattgatgctattccactaagccatcagctccatctcatccatgccatgtctcttttttagggg 
tcctcttgcczaacagaAtcacagaggacaaatctgaaagtgcagagagagcagctgaggcacagccaa 

3 0 ctctggctgtattaatgacctaagaagatggagtggtcacc^gaaagtcagaggaagtgacacacaggggc 

ccagcaatctcagccaagtcaactccacgagcctttct 

accagagccatgagagtgagtaagaccagactatgcccttgaggagctcacctctgctaagggaaacaggc 
ctggaaacacacaatggtggtaaagaggaaagaagacaatagaactgcatgaaggggatggaaagtgccca 
ggggaggaaatggttacttctgtgtgagggggttggtgaggaaagactctaagaga-.ggctctgtctggct 
35 gggtatgaaaggatgtgtaggagtcttctagggggc^^ 

gcatggcttgttgggatgagtttcaagtattctggaatgaggacagccatagagacaagaggagagtttlaar' " " 
agattttatgccaatggctccacttgagtttgtgataagaacccagtvacccttggactccccagtaacatt 
gattgagttgtgtatgattctacatagaat/ittaactcaatggaggtcagtgagtggtgtgtgtgtgatta 
tttgccaactgccgaggtggagaagcctcttccgactgcaggcagagcacgggggccctgctactggctgc 

4 0 agctccagccctgcctccttctccagcatataaacaatccaacagcctcactgaatcactgctgtgcaggg 

caggaaagctccacacacacagcccagcaaacagcagca 
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Figure 9 . 

GCCAATTAGAAGAAACACAACTACAAGGTCAGGGCATATTATTCAAACAGTAGAGACAATACMTCAAATA 
TTTGGCAGAATTACAAAATATCTCATTGGAAAAGACACGCAAGGGAAATCAACAAAAAGATATGAATCAGA 

5 ATTCATCTGTGTCTCAAGAAAAGGTCATGCGATAAATTAAGTTCTGCTAGTGTTTCTACACTACCGTTAGC 
" CTCATTACCTTATTTTTTAAGTGTTAATATAGTTTTAGGTATTTTACATACATTTTTATTATTAATTACAA 
CCAAAGTGCAACTTGTAATAGCAATTCCTTCACATTTTTTTTTTCAAATCTTGCACCTTAAAATCCACCTC 
GGGCCTCAGTTGGCCAGCTTTGGTATCTGATACTTGGACTACAGATACCACTAAGGCAAGTAGATAAAATG 
TACTCTAGGACCTACAGCCCTTCTGCTAGATCCTGAAGAATGATCATTAAAACAAGCTGGTCTAGCTGGTC 

3 AAGAGCAAAAATAAAATCAAGATGACAGAAAATTGATGCAAAAGTGAAGTAAAATAGCTAGAGAATATGAT 
TGCGCCTGTCCCCTTAGCATGGATTCCCATGCTAGCCAATCTAAAATCCTCACTGTTAGAATCCTCCTGTC 
AATATGATAGAATGAACAGCAAGCTCAGTGTCAGAAAACCTGTGTTGTTAACTTGGCCCTCTTTCTAGCTG 
AATGTGTGTTTTTGGTCAAGTTCTTTGGCATTTCAGAGACTCAGAGTAGTGAAGGAAGTGGATAAGATGAC . 
CTCTACATTCTCTTGCAAGCTCAAACATCTATGAATCCAGAGAGAAAAACTAGAGCATGAAATTAAGGTTA 

5 TTTTAAAGAAATAACCTTAAAATTATTAGTATTCGAGGATCTCCAATATATTCATGGCACCACTCAAAACT 
TTCCTTCTGCTCTATCCQGTCTTGGCTCAAAGTTATCTCCTTAATGAGGTCTGCCCTGACTATCCTACTTA 
AAATTGTAAACTTTGCCCACCTGGTACTTCCACTCTCTTTCCCCTGCTCTGTTTTTCACCGTA^TACTTTA 
CTCTTTTTAACATACAAAATCACTTATTTACTGTGTTGTTATCTATCTGCCTACTCTTACCATCAAATATA 
AGTTCTACCTAGGCAGGGATTTTTGTATGTTTTGCTCATGGATATATACGAAGCACTTAGAGTAATATGTG 

0 ACATATACAGGGTACTTGATTAATACTGTTGAGTGAATGAATGAGTTTCCAATACAAATTTAAAA.TAAAAT 
ATTTCCTAACTTAAAATTGTAAAGTCAGATCTAACCAACTGTTCATTGGTCTGCTAGCAGTGTTTCTTGTA 
TATGGAAATATATTTTAAATAGATATGTCCTGTGAAATAATACTAAGTGTTCTAAAGAAATAAGTGAGTGA 
ACGTTACCTC^TTGAACTAACTTGACCTTGCTCCTGGGAGAGAGTTCATTTGAGATTAAACAAGTTCAAAG 
TCTATGAATCATAAAACGATAAAAAAAACTAAAAGGGAAATGGTGTTTTTATAAGCTCTGCAATTCAAAAG 

5 CCATTTCGGGTAATATTGTTATTTTTATGTCAGGAATTCCTCAGTGCTGATATCTTAGGGCAAAGGGTTTG 
GTTATAAATTAAGAGAATGAGGAAATAGGTACATAGTAGGATTGTTCCAACCAAATATGTGTTGAATGTCA 
AAGG AATTTCCCTGAGG AATAATCTTCAGAATAATTTG CTAAG CACAGG AGAAAATTTGGCTTATTACTTT 
ATAGCCAGATTTCATTTTTAATTGAAACTTCTTTCAAGCAAATCACTTACTAGTCTATTAACAATAACAAC 
ATAAACACAAGTAAACATTCGGAATATAGACATCCAGGTACTAAGCTGATTGCTTTACACTCACTGTCTTA 

0 TTTTACAAGTAAGGAGTTTTAGTTGCAGCAAAAGAAATAAATTTTCCAATGTCAAATGACCAG.^ • 
CCCAATCTGTTTGGTGCTAAAGCCAATGTTCTTTACTGCAATGTTGGGTTATCTTGTTTCTAAAA.CTTAAA 
TTTATCAGTAAAAGGCAAAATTTGCTATTATTGAGGACATTAAAATCATATTTTTGTAGACTCTGAGGACA 
AATCCAACAAAAAAGTTCCAACTATTTCTTGGCAGGCATCATTGAAATTGGTATATAGCTTCCTTGGGTAT 
TGACTTTGAAAAGGAAGTTGGTCACTTTAGATATATAAGTTCAGTCTGTTTGTAAAAACAAAATGAAAACA 

5 AAACAGTTGCCTTATATGCTAAAATTATCCTAATCGTTTTCACCTTTAACAACATATACACACAGAACTTG 
AGGAACTTTACACGGCTCATCTTCATATTGTCAGCATCTAGCAAAGTACTTGCCACAT^ 
AGTTTTAGCCAGCCTGGGCAACATAGTGACAGtCTATCTCTACAAAAAAAAATTAGCCAGGCAAGGTGGCG 
CACACCTTTGGTCCCAGCTACTTGAGAGGAGGATGTGGGGAGATCCTTTGAATGCAGGAGGTTGAGGCTGC 
AGTGAGCTGTGATTGCGCCACTGTGCTTCAACCTTGGTGACAGAGCAAGACCCTGTCTCACACACACACAA 

0 ATTAAGTAAAGAAAAAAAAAAGAATCAAAGAAAAAAATAATTCCCCAGCTTAAGTCC 

GGATAAGCTATAAAGTGTCAAATAATGCTGTTAATGGACATTTCTCTAGCTCTCCCAAAGGAGGAATTiGAG 
CACATAGTATGTGCTGTATTTTATATAa^GAATAAAAATAGAGACAAGATTTCTACCCTCACAGAACTTAA 
ATTCTTCAGGAGAATGACACTGAAGTCCTTAATTGGACTTCTCTCTTCTGTATTATCTTCCTCAAGTGC3AG 
GTATATGGTGCTTAGTTATGAAAAATACCTCCAGGGCTTTGATCTTCTCAATAACTCTTTG7\GGCTGATAT 

5 GAAAACAGTAATTAGAAAAAAACCATGTATCCAACTTATATAGACAGTTGATGACCAAAGCTAGAATCCAG 
TTATTTCAGCCTCCCATGTATTTTCTTATTAC r ITAAGGAGAATCTCTATCTCTACCTCTTTCTCTCTTCTT 
CCTCTCTCACTTTTCTTAGAAACATGGGTAAGA 

TACTGGGAATTCTCAATGTTTCTTGTTTTAGCCAGTTAATTTTGGCCTTCATTCAATGTGAGTGTCCCTTA 
ATAAGGAGCAAACTCCACTGAGAGATAGATACTAATAACCAGGATTCTGAAAATGCATTCTCATCCCCATC. 

0 TCCAAACTTTTATAAAAAATATTATAAAATAATACACTTTTAATATAGGAAATTTCTCAAATACAGAAAAA 
ATTAAAGTAAACTCAGACCTAACTTTC7\TCACTAAAAGATAATCACTCTTGACACTTTGATATCTTTATCT 
CTAATATTACACCAATTAATTTGCTTGATATOGTGAATATTATGCTATTATAATTTTCCCCTGCCTTTTGT 
CCTTGCATATTAGCATAGGTATTTTCTGA.GGTTATCACAAACTCTGTAAGCACGTTTTATATTACTACTTT i- 
TTTAAAG/^GGATGTATAATAATTAATTCATCCATATATATGTGGTTAAGTATTCAGGTCACTGCTCATTTT 

5 T C A C TG T T AT A A A A T A 7v AG Cr n -C A AT G AA. T A C C T T T G G C TG AT7\T T T T TTT C TG T AC T T G G A ATT ATT T C T 
TTAAGATAGATTTCCTAAAATTGAATTACTGAGTC^^ 
TTCATTTGAATAATTTTCAACTCCTACTTC 

AAATATCTTTGCaK3TTGTAATTTGCATTTCTTTGTAGTTAGCATGAATATTTCAGTATGTTT^ 
TACCAGTATACTACATACTTTTl^ATATGAA 
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GAATTTTTTCTTATCCATTTATATAAGCTCTTTATATATTAAGAATATTAACCTATTGTGATATTTGCAAT 
AAATAGCTATATGGTTTGTTGTTGGTTTTAAATGTGAATTTATTCAATTTTCTTCAT^ 

TATAGATTTCTTTAAAAGTAATAAAATTATTGCCATATTATTATTTATTTTCAATGTCCATTACTAGCCCT 
TTCCAGTCATCACTTTTACTCTCATGTTCTTAATTTTTATTCATATCTTGGCTCCATTGACCATCTTTATT 
5 AGGATATTTGGGAAATAATACAATTTATACTAAACACACATCAAATCTTACCTTATTTTCTTCTTACAAAA 
ACCCTAGGAATATGCTGTTTTTGTCTTTATTTGAATGACACAGAAATCAAGGTTTTTGAGCAGTGGAGTAT 
TTCTTCAAATGACACAGAAATCAAGTCTTTTGAGCAGTGGAGTATTTCTTCAAAGCCACACAGCTAGTAAG 
TCATGAAGCTGGAATTCCAAGAGTTGCCACTTCATTTTCTTCTTTCCCTTTATCTTACTCAGTTGTCTTCT 
CTCCTCTTAATTTTGTCATTCATTTAAAAACATTTCTTGTGCTATTATGGTAGATTTATTTTAATAGGGGG 

1 0 CAGTGACTTACTCAGAGAG ATGATTCTCTAATGGAGTTTTAAAGATCTTAGAAGTTGATAGAGGAGGCTGG 
GCGTGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATCACTTGAGGCCAGAAG 
TTCAAGATCAGCCTAGTCAATGTGGTGAAACCCCATTTCTACTAAAATTACAAA^JVTTAGCCTGGTGTGCT 
GGTGCACTCCCATAATCCCAGCTACTCAGGAGGCTGAGGCATGAGAATTGCTGGAACCCAGGAGGCGGAGG * 
TTGCAGTGCGCCTGGATTATTAACACTACACTCCAGCCTGGGAGACAGAGTAATACTCCATCTCAATAAAA 

15 AGAAGTTGATAAGGGAGATAGTTCATGGCAACGGATCTTTGAAGGCACGCTAATGATAACTTAGGCATTTA 
GCCTACTAGTGTAATTTCCATAAATCTGCCTCTGATGTCATACTCTCAGCACCTAATATTTTCTACAAACA 
TTTATTGAAACTT.TATTTTGTATAAGTCTCTGTCCAGTTTGAATATTTAAAAAATTCATAATCATATGAAA 
CATTAATAATAAATACAAAATGAGAGATGCCGATACTGAAAAGTAGGATTGCGGAGTGGTAGAAAATATTT 
CTGGCTGTAGTAGATGGGGAAGTGTTCAAAGAGGAGTATAATTCAGGTTTCCATTTGCCATCGACTTATCA 

2 0 CATGGCTAACTCACTAAGCGACTTAATTAAAATTAAATTAATTTATCATCATCTGATCACCATTTCACACA 
ACTCATGTCTGTTGCTGTATTGGCTAAATGATGGCAAGACAAACGACCTCTGAAAATGATCCTATTGACCT 
TCGGAATCTGGATTTTTTTTTCAATGCAGGTGTCCATAGAAGCAATCTGATGTAATCCAACATGAGTTCAA 
GCACAGTCATTTAATATCCCCTATCAAGTACAGTCATTTAATATCCCCTATCATCACATGTCCTTCATACA 
TAAAAATCATTACATGTGAAAGGGTGGAGAGTGTGTGGATCCCTTATTATTGTGTTATTGTAACACAATAA 

2 5 CAATATTGTGTTATTATTGTTAACACAAGTGAGTCATATGTCTTGCTCTTTGGACTGAGTGGAAACTTGTA 

TTCTTTGTCTGCCTCAGGTCAATTAACTTCATTGAGGTGAGTTGCATTCCTTCTTTAAGCATGTTGAACCT 
TCAATCTGGACTCAGATGGGCTAAATAGAGGAGCTAGGAAAAATACAGAAAATAAATTATTAGAGAGATCA 
GAGAAAGATACATAAGATTTACAATAAAAGAATTATGAGAAAAACATCCAAAAGAATTAAAAACCATAGGA 
GAAGGAAAAATAGGTGAACAGCTTTTTAATTTCTATAAATGTGTTGTTAATACTCATAATAAAGGACTCAG 

3 0 AGCTGGGATATGAGAATAATAGGTCAAACGTATATGGATACATAGATGTGACTACATA.CATGAGTTGCAAA 

GAATGCTAAGGAGGGCAAAAAGAGATTGAGAAGAGGGCATTATTACTAATATATAGCAATGTTGAATGTTT 
AGGGTGTTTCAGGCACTGTACAAATCTTTTAAATACACAAATCACTTAATCTTGCCATAACATTAGAAGAT 
ACTATCTACTCTTTACCAAAAAGGTAACTGTGGGATAGCAGAGTTAAGTAACCCGTTCAAACCTATGCATA 
ATAATCAGCAGAGATGGTCCTTATCTAAGTTTTTCTGCCTTTGAAGTCCAAATAGTTTAATGCAGCCAGGT 

3 5 ACTAAAGAAGAAAACTTTTGTAAATTAGTTTAGTTTAATGATTTACATGTGGAAAAGCACAGAGTGAAAAG 

CACATATCATCTGAGAAGCCCAGTGAGTTTGGCTGAAATGGAGTGAACATGTACATGTTGAGGGTGAGGAA 
GATGATTAGAGAGAAGTGATGTGTTGTGGTTCTTAAAAGCTAAGAGGAAACTGTTAGATATGATATAGTCT 
GTGGCAGGGAGCCATTGCAGGTTTTCCAAGATGGACTCATAAGGAGAAAACACTTCGTGGGACTGGAAAAG 
GCAGTGAAGTGGTGTGTCCTATGATAATGTACTACAGTGTAGGATAGTGGTTAAGAGTACAGTTATGAGAG 

4 0 AGGGACTACTGGTTAGCACCTTACCTGCTGTGTGACTGGGCAAATAATGCAAACCrCAGTGTCTTTTATTG 

TAATATGGGAGTAACAAAAATAGTAACTACTTCATAGGATTCTTGTAAAGATTAJLATGACTTAATTTCTTT 
GAAGTGCTTGGCAGTTCCTGATAAATGACCA.GTAGTTAATAAATGTTAGTTGTTATTATTATCATTATATA 
TTATTACTCCCATAGATACATATAGAACAGACTGCAGCAGAGAGGCAAATCTTTAATGTTGTCAGAGTATA 
GACAAGTTGGTGAAATGGCTACATGAGAGCGGAGGACAAGAAGGTGCAGATTGTGGCAGTCACTTCAAATG 

4 5 GAAATATCACCGCTTGAATGAAGGf ATATGAGTGTCAACTTGCAAGGGGACCAGGTAGGTTTCATCAGAAA 

TTAAGGAAGCTT7\AGGAGAACAGCCAAGTTCAGCTTGACAGAAGTGGTGGTGGCACAAJ"iTGCAAGACTGGT 
GTCTTTCAAGAAACCAAGGACTGTTGAAAGTAGC7\AGAGCTAGTTTGTTTTAGGTCCATCATGTTTTATAT 
TCACACTTTC AT G TC AG TGG AG C AAAG AAAT G G AAT A C AAT AT AAT AG AAT G G TAG .AAT C T TAT T T T T AAA 
ATCTGTGTTATTCTGATCTTTAACTTACTTATA.TCTTTGATAGAGA.TCTTTACCTGATGCTCAAGATTGTA 

5 0 GAAATAGTATAATCAACATAACAGTATAGCACTGTATTTATATCCTGCACTGTTTAGGGAGGGTTTAAGGC 

C AT T C AAAAG G AT AC AT AAAAT A C AAC AAG AT T AC AT AAATG AAAG GTG AG AT AAAG C A r tC AAAG C AAAAC 
AAAAGTGAAAACAGAGATCATAGGCACAAATAAGATTAAAAACGCATGTA/vTGA^.GATGAAAGCTTTTA^ 
TTTACCCCAGATGGACCACAGGGTTGTTGTTAAGCCTTTAAACAGTGAACAATGCTGTACACTTGCATATG 
C AAT T A G A AC AT G TG G AAAAAAT AG TG G C C T G T T AG 7\AG C C T AAT T A AC AA T T T G T G AAA r ^ AAAAAAJ)AAA 

5 5 /lAJLAAAAAAAAGAGGCCGAGCTGTAGCTCACGCCTGTAATCCCTGCACTTTGGGAGGCCGAGGCGGGCGGA 

TCACGAGGTCAGGAGATCAAGACCATCCTGGCTAACACAGTGAA/vCCCAGTCTCTACGAAAAATACAAAAA. 
ATTAGCCGGGCGTGGTGGCGGGAGCCTGTAGTCCCAGCTACCTGGGAGGCTGAGGCAGGAGAATGGTGTGA 
ACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATCCTGCCACTGCACTCCAGCCTGGGCGACAAAGCAAGAC 
TCCGTCTCAAAAAAGAAAAAAGAAAGAAAAACAA/iAGAAAACTTCATTGTATTGTA^GGCCAAGAACAAM 

6 0 TATATCAAGATAAGGAAAATTTGTAGTC7^G/\ATAGAAAAAAATTATGGCTTTGA^GTATGAGTTATTTAA 
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AGAAAGTGGAAACATCCTCAGACTATGCAGTAAAAAACAAAGTGATTTTCTTCTTCTAAACTTATGCAATA 
AACTGATAGGTAATATGTGAAAGTCATAGAATGTAGACTAGAGGATACAACAAACCTATTTCCTCTATGTT 
CATAAGAAGTAAGAAAAGCTCTGATGTGAGTTAGCATTGCTTTACAATTTTGAATTGTGCAGATTGCACGT 
ACTTTTCCTCAGTTTGAAGTAAATAGTGGACAGGAAAAAATATTAAATGTTGGCAGTAAATATGGAAGGAA 
5 ATTACAACTAATGTAATATGCTAAAACATGCTATGTTTATTTTACTAATTTGAATTAAAATGTAAGAATTT 
AAAATGCCCTGGAAAAACACGGGCATTGATCTGACGTCTGAAGTTTTAAAATATTACACACTTTGAAATAG 
CATTTGTACCTTGAAATACCTGTCTCTATATATTTTTTAAAACTTCCTTTTTCTTTCATTCCATTTATCAT 
CAAATAAAGGATGAACAGATGTAACTCAGAAACTGTCAAGCATGCTGAAGAAAGACaACTGCAGAAAAATT 
TCTCCTAGCCTXTTCAAAGGTGTTAGGAAGCAGAAAGGTGATACAGAATTGGAGAGGTCGGAGTTTTTGTA 

10 TTAACTGTATTAAATGCGAATCCCGAGAAAATTTCCCTTAACTACGTCCTGTAGTTATATGGATATGAAGA 
CTTATGTGAACTTTGAAAGACGTGTCTACATAAGTTGAAATGTCCCCAATGATTCAGCTGATGCGCGTTTC 
TCTACTTGCCCTTTCTAGAGAGGTGCAACGGAAGCCAGAACATTCCTCCTGGAAATTCAACCTGTTTCGCA. 
GTTTCTCGAGGAATCAGCATTCAGTCAATCCGGGCCGGGAGCAGTCATCTGTGGTGAGGCTGATTGGCTGG 
GCAGGAACAGCGCCGGGGCGTGGGCTGAGCACAGCCGCTTCGCTCTCTTTGCCACAGGAAGCCTGAGpTCA 

15 TTCGAGTAGCGGCTCTTCCAAGCTCAAAGAAGCAGAGGCCGCTGTTCGTTTCCTTTAGGTCTTTCCACTAA 
AGTCGGAGTATCTTCTTCCAAAATTTCACGTCTTGGTGGCCGTTCCAAGGAGCGCGAGGTAGGGGCACGCA 
AAGCTGGGAGCTACTATGGGACAGTTCCCAAGTGTCAGGCTTTCAGATTTCCTGAACTTGGTCTTCACGGG 
AGAAGGGCTTCTTGAGGCGTGGATAGTGTGAAGTCCTCTGGCAAGTCCATGGGGACCAAGTGGGGTTAGAT ; 
CTAGACTCAGGAGCTCCTGGAGCAGCGCCCAAACCGTAGTGGCACTGGACCATGTTGCCCGGAGCGCGCAC* 

2 0 AGCCCGCGCGGTGCGGGGACCTGCTCTCTGAGCCGGCGGGCGGTGGGTGGGAGGAAGCATCGTCCGCGGCG 
ACTGGAACCGGGAGGGAGAATCGCACTGGCGGCGGGCAAAGTCCAGAACGCGCTGCCAGACCCCCAACTCT 
GCCTTCGTGGAGATGCTGGAGACCCCGCGCACAGGAAAGCCCCTGCAGTGCCCATCGCGGCCAGAGCAGCT 
GGGGCATCAACGGCGGGCGCTCCCTCTTACTGCTCTCTGGCTTCGACGGGGGACTAGAGGTTAGTCTCACC 
TCCAGCGCGCCTGAGGCTCATGCATTTGGCTAATGAGCTGCGGTTTCTCTTCAGGTCGC-AATGGATCTTGA 

25 AGGGGACCGCAATGGAGGAGCAAAGAAGAAGAACTTTTTTAAACTGAACAATAAA^.GGTAACTAGCTTGTT 
TCATTTTCATAGTTTACATAGTTGCGAGATTTGAGTAATTTATTTCTAGCCTCCAGCTCTGAAATAAATGA 
CATGTTGTTGTTTTTAATTATTTTTAAGAAACGCAAGCTAGCCTTTG 
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SEQUENCE LISTING 

<110> THE UNIVERSITY OF SYDNEY 

<120> THE UNIVERSITY OF SYDNEY 

<130> P41173 

<160> 5 

<170> Patentln version 3.1 

<210> 1 

<211> 12983 

<212> DNA 

<213> Homo sapiens 

<400> 1 ' 

ctggttcatc tcattgggac tggttggaca agagggtgca gcccacggag ggtgagccaa 
60 

agcagggtgg ggcgtcgcct cacctgggaa gcacaagggg tcgtggaatt ttctccccta 
120 

cccaaggaaa gccataaggg actgagcctg aggaactgtg cactctggcc cagatactgc 
180 

acttttccca tggtctttgc aacccgcaga ccaggagatt ccctccggtg cctatgccac 
240 



cagggccctg ggtttcaagc acaaaactgg gcagccattt gggcagacac cgaactagct 
300 

gcaggagttt tttttttttt tttccatacc ccattggcac ctggaacgcc agtgagacag 
360 



aaccgttcac tcccctggaa agggggctga aaccagggat ccaagtggtc tggctcggtg 
420 

ggccccaccc ccatggagcc cagcaaacaa agattcactt ggcttgaaat tcttgctgcc 
480 

agcacagcag cagtctgaga ttgacctggg accctcgaac ttggttgggt gctgtggggg 
540 



ggcatcttcc attgctgagg cttgagtagg tggttttacc ttcgcggtgt aaacaaagct 
600 

gctgggaagt ttgaactggg tggagctcac cacagctcag taaggccact gtggccagac 
660 • " , 



tgcctctctg gatttctcct ctctgggaag . gatatctctg aaaaaaaggc agcagcccca 
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720 

gtcagggact tatagatgaa. acccccatct ccctgggaca gagcccctcg gggaagaggt 
780 

ggcttccacc attgtggaag actgtgtggc aattcctcac ggatttagaa ctagaaatac 
840 

catttgaccc agcaatccca ttactgggtg tatacccata ggattataaa tcattctact 
900 

ataaagacac atgcacactt atgtttattg taacactatt tacaatagca ataacctgga 
960 

accaatccaa aagcccatca atgatagact gaataaagaa aatgtggcac atatacactg 
1020 

tggaatacta tgcagccata aaaaaggatg agttcatgtc ctttgcagag acatggatga 
1080 

agctggaaac catcattctc agcaaactag cacaataaca gaaaaccaaa cactgcatgt 
1140 

tgtcactcat aagtgggagt taaacaatga gaacacatgg acacagggag gggaacgtca 
1200 

cacactgggg catgtcgggg agtgggggcc tacgggaggg atagcattag cagaaatacc 
1260 

taatgtaggt gacgggttga tgggtgcagc aaaccaccat ggcacatata cacctatgta 
1320 

ataaaactgc acgttctgca catgtacccc agaacttaaa gtataattaa taataataat 
1380 

aatttctggg catgtaagta gctgtctttc aggttctact ttqatacata ttctgagaga 
1440 

attaaacctg tcaaagaaac cttgactttc aatggcaggc actggaattg accctaataa 
1500 

tgtgttttgg ggtaagccta ctcatattct caacctgtct gcaatagtcg ttaaaatctg 
1560 

aacttcctga agttcatgtg caaagttgag ttaattgttt aatattcaac aaagattatg 
1620 

ccagtaagat ggtaggaaaa tattagatat gtgtcatcac toctggtatt atttaaactg 
1680 

f 

caacatattt tagctggctg ctgatct'cag ccaccatgcc tgcattttat ctctgtctcg 
1740 ■ 
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tggtctgcaa ccttggaagc 
1800 

tggttctaat ggctagatag 
18 60 

aaacaacaga tttggagggg 
1920 

agttgtcctt tcatttcctc 
1S80 

ccaattcctg ccttctccaa 
2040 

cttagcaagc gcccaacctt 
2100 *i 

ttcttgataa aaagaaaata 
2160 

ctctcggcaa ctactgaact 
2220 

caagagtcag gtactaggct 
2280 

gccagacttg cagatctagg 
2340 

ctctgagctc ctttgggagg 
2400 

actgaggctg aggaagttta 
2460 

gatccctact cactgtcttc 
2520 

caggagcctg gctgggcacg 
2580 

gcaggcagat cacctgagct 
2640 

at etc tact a aaaatacaaa 
2700 

tacttgggag gctgaggcac 
2760 
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tttgaactta gctcatagaa tcctgggcat caagaacatg 

ggaatgagag taaaaggatt ttgcccacgg tcacgtgagt 

tctggactac tgtgatgact tcattctgac aatatgttcc 

ctaatcacat gtctggtctg atctggctgt ttcccacctt 

tgctcccttc cgtaggtcac tctgtgg'ctc agagaccctg 

tcaattattt gttcagtaaa acttgaactc atgtctcccc 

cgttatgtaa tgtcgggtta ctctataact cttgtcctgt 

aactgttttc atattgagca aacgtttatg gaaggactgc 

tggtaatatt ccccgttctc tctagtcaaa gccaacacca 

tcccaagccc actgeagate acaggecagg gtctggtctc 

gaaagacaga attattaaca cccattttgt agattaggca 

aataactcag acagggectg cacgtcagtc atattccaag 

tctctacaga acgagatgtc tctggagtcc atagaaagee 

gtggctcctg cctgtaatcc cagcactttg ggaggecgag 

caggagttca agaccagcct gggcaacatc gcaaaacccc 

aaattagctg ggcgtggtgg tgcatgcctc taatcccagc 

aagaattget tgageccagg aggcagcagt tgcagtga^gc 
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tgagattgtg ccagtgcact 
2820 



aaacaaacac aaacaaacaa 
2880 

cagccacacc tttttcttgt 
2940 

tttgtggttg cacaaattca 
3000 

ctattttttt tttgaaggaa 
3060 



tcacccattg ctggtgagaa 
3120 

tcctcaaaaa gttcaacaga 
3180 

ctgagaggat taaaaacagt 
3240 



tcacaagggc caaaaggtgc 
3300 

gtggtctatc catacaatgg 
3360 

gctacagaat ggatgaaccg 
3420 



tcactttact atgaagtg^a 
3480 



ccagaagagg gaagcccaca 
3540 



aggggagagt ggggagaaac 
3600 

attttgcaac tagatagagg 
3660 

ttttttcacc ttaaaatggt 
3720 



ttaattttat gttatgtgaa 
3780 



tagcaggcat ggaacaggtc 
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ccagcctggg caacagagca 

acaaaaatag aaagcccagg 

cctcctctgt ctctggcatc 

aaatcccaga aaaattacca 

gataacaagt gttgacaaga 

tgtaatacgg tgctgctgct 

atttcaatgt gacccagcaa 

tactaaaata cacggactca 

taatagccca catgtccatc 

aatattattc ggccataaat 

caaaaaaaat ggatgaacac 

ggccagaaac gaagtccata 

gagacagaat gtgcaatggt 

tgctcaactg gtacaggctt 

tagtgattgc agaacacaga 

taattttcag tcctgagatt 

tttcatccct atacatattt 

actgtggtgc ctgccaagcc 
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agattccatt tcaaaaacaa 
gaccacctgc gtcaggttcc 
ttctcacagg ttcctaattg 
cttcacaccc actcagatgg 
acatggagaa attggaattc 
atggaaaaca gcttggagtt 
ttcccctcta agttatagat 
catatttcta acagtccaat 
gatggatgga ta'aataaatt 
ggaatgaagt actgacgcat 
atgctacaga atggatagcc 
tattgcatca tacaaaatat 
ggatgccagg gtctggggag 
tattttggaa tgatgggaac 
atgtactgaa ttccactgat 
ggataatcat aaaaaaatgg 
taaacctcag aaatatacac 

r 

cggtgatgtt atctggggtc 
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3840 



cccggccagc cttaagcctc 
3900 

taatatccac atgctggcat 
3960 

actgcctcga tacagacact 
4020 

ggttaagctg ggcctttctc 
4080 

aaaacaaagc aactccaaca 
4140 

agaaatgcag ctgtc&aaaa 
4200 

ctcagtggtt catgcctgta 
4260 



aagccagaag ttcaagacca 
4320 



aaacatcaaa aaattagcca 
4380 



gctgaggcag gagaattgct 
4440 

ccattgtact ccagcctggg 
4500 . . 



agaaaaggat tgggatgcca 
4560 



tacgttttct atttttttat 
4620 

tctcactttc ccttcttctg 
4680 

taggaagact tcaccccagg 
4740 

ggtgtctctc cagttccctc 
4800 



tgtctgcaaa ggtcttgttc 
4860 
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ttgctgaccg gtggagggca 

gattcctggc cagatggctt 

gtaccttgca accaaaaaat 

tctttgccag ttaaattata 

ggtatcacaa gggcaaagga 

tacagaaata ttcaaccttg 

atcccagtgc tttgcaaggc 

tcctaggcaa gtcagttcaa 

ggcatggtga tgcatgcctg 

tgagcctggg aggctgcggt 

caatgcagca agactgtcta 

tttacttgcg tattcaatac 

tactaaaaaa agctggacca 

tccaaacact tctctatgat 

tagttccaga tccccttatc 

cgtgtggtga agtaccctac 

acaccttccc ttatggttct 
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gaacctttgc cctaaaagta 
ctttattagc agtaattgaa 
gactcaacaa tgataataag 
tttattatag cttgacatga 
catgaacatt ttatcaaaga 
ttcataataa agtggctggg 
tgagacagga ggatcatttg 
taccagactt catgtctaca 
ttgtcccagc tactcaggag 
ggcggtgagc catgattgtg 
aataacaaaa ataatagtaa 
acagagttaa aagtaatttc 
ttctcacagc ctgaaatgct 
aatgcaaaca gtcactcctt 
tctgccttcc cagaactcct 
ctagggtttc agtatggctc 
gttgccctgt cttgtgtcat 
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agcacagggc acagtggaga acccattcac actgatagag agggccccat ggtcctggag 
4920 

ataaccatgt aaccgatcag aataaggcat tgagggctgg gtgtcaggcg tgggctgcac 
4 980 

ttgggtgggc aggtcccctg gaaagtcact gggtttggca agcttcctag taacatgtct 
5040 

ctctggggtc ccccttggaa cttcatgcaa aaatgctggt tgctggttta ttctagagag 
5100 

atggttcatt cctttcattt gattatcaaa gaaactcatg tcccaattaa aggtcataaa 
5160 

gcccagtttg taaactgaga tgatctcagc tgaatgaact tgctgaccct ctgctttcct 
5220 t 

ccagcctctc ggtgcccttg aaatcatgtc ggttcaagca gcctcatgag gcattacaaa 
5280 

gtttaattat ttcagtgatt attaaacctt gtcctgtgtt gaccccaggt gaatcacaag 
5340 

ctgaacttct gacaagaaca agctatcata ttcttttcaa ttacagaaaa aagtaagtta 
5400 

attgatagga ttttttttgt ttaaaaaaaa tgttactagt tttgaaaagg taatatgtgc 
5460 

acatggtaaa cactaagaag gtataagagc ataatgcttt tatactacta agaataatgt 
5520 

tttctctaag ttttttttgg tagatgcttt catcagatta agaaaattcc ctgctattag 
5580 

ttgttgaagg tttttatatc ataaatgaaa gttgaatatt attatcatat attattaata 
5640 

tattgttatt gaactatcaa agccttttcc taaaaccatt gagatgatct tataaccatt 
5700 

ctcctttaac ctgttgacga gatcattggt atttatacta tttctctgtt aaccattctt 
5760 

gagtctcagg tttaaattca acttggtcat ggtgtgtcat ctttgatcat tgctgtctgt 
5820 

ggcttgctac tgttttgttt aggatttttg cactgatgct catcaatgag actggcatgc 
5880 
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catcttcctt tgcagtcctg atttttttct gatttggatc atgtggttat ggccctcatg 
5940 

gaatgagttg ggcatgatgc ctttttttca tgtctctgga ttgatgggac actttggatt 
6000 

ctctccagat ggccctcaat ggtccctgcc tcctcattgt taggccccta ggcaagccct 
6060 

tctcatttct ggtaggccca ggaacctgtg ggggttttgt ttgtttgttt gtttcttgag 
6120 

tcggagtctc actctgtcac ccaggctgga gttggagtgc aatggcccga tcttggctca 
6180 

ctgcaacctc cacctcccag attcaagcaa ttctcctgcc tcagcctcct gagtagctgg 
6240 

aattacaggc acccaccgac acaccctgct aatttttgta tttttagtac agatggggtt 
6300 

tcacaatatt ggccaagctg gtctcgaact cctgatctca tgatctgccc agcttggcct 
6360 

cccaaagtgt tgagattaca agcatgagcc accacaccca gtgaacctgt ggtttttaga 
6420 

agctccccat gcatgtgaat gctgtgagca tcccaggatg acagccactg tgtgttcagc 
6480 

tgttggaact gtgagaaagc accagtggga ccttctccag cacctgccta ctgagttcat 
6540 

ggaagaggct tgttggggag atgatgccct ggctgactcc tgaaggatgg ttaggaatgc 
6600 

accagatgga agctgggttg gacccactct atgctgaaga acagcttgta tagacacaag 
6660 

gagacacgga tatgtcattt ttgtagagcc tgaggagtgt ccaatcacac catttgctta 
6720 

aaacatcatg cacacttgga aaagtggact gagaccgaat gaagaagcta acagtggcca 
6780 

gatcagaaag qgtcttgtgt tacttcctag agatacttag attttatcct gtgggtgata 
6840 

ggagcagttg gagggactga agacaaggaa agaaacatgt ttcaagatct atgtttttca 
6900 

agacgctttt ctgqtqgctg agtagggaat tccctggata agtcctgccc agggt-caggc 
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6960 

aaaacaagtt agggggttac tgaaataagg agtatgagaa atggtgtagg ttgtgctgac 
7020 

gttttgtaac acatctcatg atgatcttca tttccttcac taatttcctg tttcattaat 
7080 

tcccttccac gtgctcttct gaaatttgcc tcacattctc tgatttctct tttacctgtt 
7140 

ggtttcatca ccttttactt tttgctttcc tggaaacaca aatgattctg attgtgacat 
7200 

gtcagaatta tttgcaacat ttgcctttct gctgaaacca tgagttcact gaatacacaa 
7260 

tttagtaaag tgtaggatgc acatgtcgtt ttcgtggtca caaccagctc tgtagcattt 
7320 

tataactaca ctggcagtgt gctgggaggt gtagagagaa atatttatca catgtgtggc 
7380 

tgacacaacc tgccaagtta ttttaggagc ctccttggaa tcccagcaag aatgctaccg 
7440 

gcacaatttg taatcacagc atcctgctcc atgccttggc ttcatggcat agtcacttct 
7500 : 

gcaagtctct ttccagctgt ctgttcccat gtctataaag tatgagttaa atcatcctaa 
7560 

cactactcat cttacaaagt tttcttgctg atgttaagag agttgggaaa gaactgtata 
7620 

aactgtgaag tgccatggag atgttagtgg ttactttatc aagaaataca cactctagaa 
7680 

tggagtagaa agccaacagt tatgattgag tcctcctcct cttcttcttt ttattaattt 
7740 

ataaagaaaa gaggtttaat tgactcacag ttccatatgg ctgcrggaacc ctcggaaaac 
7800 

tctcagtcat agcaggaggc aaaggggaag aaggcacctt cttcacaacc cagcagaaga 
7860 

gagagagctc ctgttctttt ttgtcataaa gtctacagaa gtgcttatac ttcaggacaa 
7920 

gggcagqcag agagaaggaa ggacattgct tcaccccagc cctcactaac aacrtttgcta 
7980 . 
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ggggacctca ctttgtccca gagtagggca gaactctggc cactacccat tcagaaggcc 
8040 

tgggctgcac tgctagttcc tcactaactc tgtgtggcct tgggcaaggt tgggcctgtg 
8100 

ttaacagatt atgaccctgg gctctcaagc tagaggatct aaatttgaat cctggctctg 
8160 



ctaaagcaat tagtgatgta aactttaatg ggtcagttaa ccttcctgtg gcttagtttg 
8220 

ctcatctgta aaatagggat cataacagta tcaataccac atgattgttg gacagattga 
8280 

atcagttaat gcaggggaag tacttagcat gacacgtatt cactatcatt tcctggagta 
8340 i 

agagctgtgt gtgagtgggt gtgagcatgt gtgaaacctt ttctctgcaa tctcagttaa 
8400 

gaaaccaatc cagaatttaa agttcagggc ctaaatgggt ggttatcttc tcccagttcc 
8460 

atcctatccc acctttgctc ttcctcccgc ccacaggagc tgttggtcct tgattgggct 
8520 

ggaagacctg gtggacccta agtgatctat aagaggagaa tagagaacag cgaatgtctt 
8580 



caaaaatcta gagggacaca gaggctgaga ggcaggcagt cctgcagggt cttctgattg 
8640 



ggacaaggag aaccttggtc ttcacaggcc aattctggtc agtttccccc atggacagat 
8700 



gaggaaacag gcccaggaat atccaaggtc tcacacttcc catctgtcaa ctcttgttga 
8760 



ttctgttgta ttcatgtctc tcaaagggag atagagttta gggaagaaag aaggatcaac 
8820 



tgtgtctgat accactggga gcttaagtaa agggttcttt tacttcatag catttatccc 
8880 



aatttgtaat tcagtattat ttgtgtggct gtttggtgtc tctttctcct atatgagtgc 
8940 



tagcttcata agggcaagga ttttgattct ttaatattta gtgcttgcca catgccctga 
9000 
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acacagcagg catacaggct aaccaacata cagtggcatg aaagtcatga aagtgagaca 
9060 

cctacctcct ccagtgccaa gagagcataa ccatgcacct gtcactctcc tcaacaccac 
9120 

ccccaagcat gaggcccaaa agcattagct aatcccctcc tccagccact aaaacttaaa 
9180 

ggccaggtgt ggtggctccc atctgaaatc ccagaacttc aggagacagc agcaggagga 
92.4 0 

tcacttgagg ccaggagttt gagatcagcc tgggcaacat agctaggtcc catctgtact 
9300 

aaaaattagc tgggcgttgt tgcatgcctg tagtcccagc tactaaggag gctgaggtgg 
9360 

gaggatcact tgagcccagg aggtggaaac aacagtaagc tataatcaca gcactgaact 
9420 

ctagcctggg caacagagtg acaccctgcc tcaaaacaat tttaaaaata aataagagca 
9480 

aaacttagat accacgtggt caccccaaca tgcaaaatca agttttcccc tactgagaag 
9540 

aatggggact tgacagctga gttacagaga gataatcttc ttcttctttt tttttttttg 
9600 

gtttacatcc tcaagatcat gacttgtgaa atttgaatcg aatacacatg taattccaga 
9660 

gcaatgttgc ctccgcatac catcagcaat tcacttggct actggaagtc aggataagct 
9720 

tcccagaaga gaggtaccac ttgggctacc aatataaaag gatgaaaata tcagagtgat 
9780 

ggtgttcttt acaacgttga gtccctggac agcctgtcca ctgatgctga tatctgagcc 
9840 

taatgcttct ctgaatgttg agattgaact ttgatccaat gaaactaaaa cgagaaagaa 
9900 

gataagtctt tcattgttga taaggacatt atgtttctca tacttgtatg attatttttc 
9960 

cttagctgta ctataattat ctgcttattt gtctctgctc tatgtgctta aggtacaaag 
10020 

. ttgaccaaga ccaactttgg t-tggaagcat agtactaaga gcacagtact gagagcacag 
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10080 

tattgagagc acagctttaa 
10140 

agaggcatgt ggtggttgga 
10200 

aagtccctac atgttagaag 
10260 

aattattttc ctttgcagaa 
10320 

tttaagtccc aagcgatgat 
10380 

acctcctaaa gtttgtaggg 
10440 

gtatgtgttt tagaggcaag 
10500 

aaatataaga aaggctctcg 
10560 

acactggaat ctttacttct 
10620 

gagggagatc ctccagcaat 
10680 

gctgagatga gatggagtcc 
10740 

ccttgagaca ggaacacaaa 
10800 

tcccatagtg atgcgtagca 
10860 

ctcccttcct gttagctttt 
10920 

ttgcagtcac tgagcctcag 
10980 

cttaatgtgc tcagcaaagc 
11040 

ttttatccta cctctcttgg 
11100 
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aaaacatgat gaaggcttta 

tgtatcttcc ttgacacagt 

atgttacctt ctgtggaatt 

caacaccaac tgcattagtt 

tagtctctca ctgttggtat 

caggtaaatc ctcatcttag 

aagaaaacat atttgtttct 

gtttaggtga ggtaatgaag 

ctgaacgtgt tctgtgcatc 

gccactgata tggtcagaaa 

tttgttctag aagacccatg 

tcccaaccaa tttgtggttg 

gggaatggca ggtgcaccag 

tataaagttt tattgtgatc 

tttctacatc tgtaaactgg 

cactgaaagg agacagaaat 

ggattgtcac ca cc ttccca 
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atacaggaaa tgagcagggg 
cagtgcagct ctcagtagtc 
aagtggcaga acttgccttc 
aggacacagt gctggctgca 
agattcaaac caatcagacc 
aataaaaatc atcttaccaa 
gtaagagttt tgtttaaaaa 
ttgttgataq ttatcagatg 
tctcagtgtg ggaacataga 
ctgcatcttt ctttctccct 
gtggtgccgc tgggagtaac 
cagccttgag tctcactatt 
agcaggagag gacctaatat 
agtagcagtt gggaagctac 
ggatagtagc atggccccta 
gtatctaaat taccctggac 
tgtttgtcct ttttggtttg 
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atgcttgctg 
11160 

tccagagtta 
11220 

taggctgtcc 
11280 

cattcaccca 
11340 

ccaggtgccg 
11400 



ggggtgagta 
11460 



gcgaacaagc 
11520 

tgtttgtcat 
11580 

ataatgagct 
11640 



tcagggagct 
11700 



ggcctgccct 
11760 

aactccaatt 
11820 

acctgcagtg 
11880 

ctgggcagct 
11940 



gagaaggggc 
12000 



gaagggttca 
12060 

catccatgcc 
12120 



tcacttcttt 
cagcacatgc 
ccaaggaaca 
cttggccttg 
agcagccctg 
tcagtittctg 
acagcttaaa 
gcaatccatt 
gatataattt 
cacaggtggc 
cctcccatta 
ccacccccaa 
accactgccc 
gttctcttct 
cactctttgg 
caggaagcag 
ctgtctctcc 



Untitlei 
ccttaggtgc 

ataccaccat 

tgtggctccc 

aatctgacaa 

gaggcttagg 

caggagggct 

catcgcctat 

catcaggcca 

tcacaccctc 

agcaactgct 

gaaaatcctc 

gaggctggga 

catcattgct 

ctcctttctc 

caaagaacct 

cacaaattga 

tttaggggtc 



.ST25.txt 
ctctctgtac 

ccaagcatgt 

ggcacacacc 

ggaatctggc 

ggccagaggg 

gaatgtgaga 

ttctattgag 

ggcagacaca 

atcactgaga 

attacaggcc 

cacttgtcaa 

ccatcttact 

ggctgaggtg 

tcctgtttcc 

gtctaacttg 

tactattcca 

cccttgce-a-a 



ggctctttta 
ttatttgtct 
tggcacaaca 
atgatgttca 
atgggaaaag 
aagaataaag 
ttttaagaac 
gaacttgggt 
tctctcccat 
tcatctctac 
aaaggaagcc 
ggagtccttg 
gttggggtcc 
agacatgcag 
ctatctatgg 
ccaagccatc 
cagaatcaca 



tcccagggat 
cctgcttcac 
ctgcacatga 
cccactcagg 
gtgtctttct 
agagaaggaa 
gctgtgattt 
gtgagtgacg 
caggaatggg 
cagctcctgg 
atttgctttg 
atgctgtgtg 
atctggctat 
tatttccaga 
caggaccttt 
agctccatct 
gaggaccagc 
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ctgaaagtgc agagacagca gctgaggcac agccaagagc tctggctgta ttaatgacct 
12180 

aagaagtcac cagaaagtca gaagggatga catgcagagg cccagcaatc tcagctaagt 
12240 

caactccacc agcctttcta gttgcccact gtgtgtacag caccctggta gggaccagag 
12300 

ccatgacagg gaataagact agactatgcc cttgaggagc tcacctctgt tcagggaaac 
12360 

aggcgtggaa acacaatggt ggtaaagagg aaagaggaca ataggattgc atgaagggga 
12420 

tggaaggtgc ccaggggagg aaatggttac atctgtgtga ggagtttggt gaggaaagac 
12480 

tctaagagaa ggctctgtct gtctgggttt ggaaggatgt gtaggagtct tctagggggc 
12540 

acaggcacac tccaggcata ggtaaagatc tgtaggtgtg gcttgttgag atgaatttca 
12600 

agtattttgg aatgaggaca gccatagaga caagggcaag agagaggcga tttaatagat 
12660 

tttatgccaa tggctccact tgagtttctg ataagaaccc agaaccctta gactccccag 
12720 

taacattgat tgagttgttt atgatacctc atagaatatg aactcaaagg aggtcagtga 
12780 

gtggtgtgtg tgtgattctt tgccaacttc caaggtggag aagcctcttc caactgcagg 
12840 

cagagcacag gtggccctgc tactggctgc agctccagcc ctgcctcctt ctctagcata 
12900 

taaacaatcc aacagcctca ctgaatcact gctgtgcagg gcaggaaagc tccatgcaca 
12960 

tagcccagca aagagcaaca cag 
12983 



<210> 2 

<211> 9715 

<212> DMA 

<213> Homo sapiens 

<400> 2 
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ctggttcatc tcattgggac tggttggaca agagggtgca gcccacggag ggtgagccaa 
60 

agcagggtgg ggcgtcgcct cacctgggaa gcacaagggg tcgtggaatt ttctccccta 
120 

cccaaggaaa gccataaggg actgagcctg aggaactgtg cactctggcc cagatactgc 
180 

acttttccca tggtctttgc aacccgcaga ccaggagatt ccctccggtg cctatgccac 
240 

cagggccctg ggtttcaagc acaaaactgg gcagccattt gggcagacac cgaactagct 
300 

gcaggagttt tttttttttt tttccatacc ccattggcac ctggaacgcc agtgagacag 
360 

aaccgttcac tcccctggaa agggggctga aaccagggat ccaagtggtc tggctcggtg 
420 

ggccccaccc ccatggagcc cagcaaacaa agattcactt ggcttgaaat tcttgctgcc 
480 

agcacagcag cagtctgaga ttgacctggg accctcgaac ttggttgggt gctgtggggg 
540 

ggcatcttcc attgctgagg cttgagtagg tggttttacc ttcgcggtgt aaacaaagct 
600 

gctgggaagt ttgaactggg tggagctcac cacagctcag taaggccact gtggccagac 
660 

tgcctctctg gatttctcct ctctgggaag gatatctctg aaaaaaaggc agcagcccca 
720 

gtcagggact tatagatgaa acccccatct ccctgggaca gagcccctcg gggaagaggt 
780 

ggcttccacc attgtggaag actgtgtggc aattcctcac ggatttagaa ctagagatac 
840 

catttgaccc agcaatccca ttactgggtg tatacccata agattataaa tcattctact 
900 

ataaagacac atgcacactt atgtttattg taacactatt tacaatagca atgacctgga 
960 

accaatccaa aagcccatca atgatagact' gaataaagaa aatgtggcac atatacactg 
1020 

tggaatacta tgcagccata aaaaaggatg- agttcatgtc ctttgcagag acatggatga 
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1080 

agctggaaac catcattctc agcaaactag cacaataaca gaaaaccaaa cactgcatgt 
1140 

tgtcactcat aagtgggagt taaacaatga gaacacatgg acacagggag cggaacgtca 
1200 

cacactgggg catgtcgggg agtgggggcc tacgggaggg atagcattag cagaaatacc 
1260 

taatgtaggt gacgggttga tgggtgcagc aaaccaccat ggcacatata cacctatgta 
1320 

ataaaactgc acgttctgca catgtacccc agaacttaaa gtataattaa taataataat 
1380 

aatttctggg catgtaagta gctgtctttc aggttctact ttgatacata ttctgagaga 
1440 

attaaacctg tcaaagaaac cttgactttc aatggcaggc actggaatta accctaataa 
1500 

tgtgttttgg ggtaagccta ctcatattct caacctgtct gcagtagtcg ttagaatctg 
1560 

aacttcctga agttcatgtg caaagttgag ttaattgttt aatattcaac aaggattatg 
1620 

ccagtaagat ggtaggaaaa tattagatat gtgtcatcac tgctggtatt atttaaactg 
1680 

caacatattt tagctggctg ctgatctcag ccaccatgcc tgcattttat ctctgtctcg 
1740 

tggtctgcaa ccttggaagc tttgaactta gctcatagaa tcctgggcat caaqaacatg 
1800 

tggttctaat ggctagatag ggaatgagag taaaaggatt ttgcccacgg tcacgtgagt 
1860 

aaacaacaga tttggagggg tctggactac tgtgatgact tcattctgac aatatgttcc 
1920 

agttgtcctt tcatttcctc ctaatcacat gtctggtcta atctggctat ttcccacctt 
1980 

ccaattcctg ccttctccaa tgctcccttc cgtaggtcac tctgtggctc agagaccctg 
2040 

cttagcaagc gcccaacctt tcaattattt gttcagtaaa acttgaactc atgtctcc'cc 
2100 

Page 15 



WO 02/36784 



ttcttgataa aaagaaaata 
2160 

ctctcggcaa ctactgaact 
2220 

caagagtcag gtactaggct 
2280 

gccagacttg cagatctagg 
2340 

ctctgagctc ctttgggagg 
2400 

actgaggctg aggaagttta 
2460 k 

gatccctact cactgtcttc 
2520 

caggagcctg gctgggcacg 
2580 

gcaggcagat cacctgagct 
2640 

atctctacta aaaatacaaa 
2700 

tacttgggag gctgaggcac 
2760 

tgagattgtg ccagtgcact 
2820 



aaacaaacac aaacaaacaa 
2880 

cagccacacc tttttcttgt 
2940 

tttgtggttg cacaaattca 
3000 

ctattttttt tttgaaggaa 
3060 

tcacccattg ctggtgagaa 
3120 



Untitled.ST25.txt 
cgttatgtaa tgtcgggtta 

aactgttttc atattgagca 

tggtaatatt ccccgttctc 

tcccaagccc actgcagatc 

gaaagacaga attattaaca 

aataactcag acagggcctg 

tctctacaga acgagatgtc 

gtggctcctg cctgtaatcc 

caggagttca agaccagcct 

aaattagctg ggcgtggtgg 

aagaattgct tgagcccagg 

ccagcctggg caacagagca 

acaaaaatag aaagcccagg 

cctcctctgt ctctggcatc 

aaa tcccaga aaaattacca 

gataacaagt gttgacaaga 

tgtaatacgg t get get get 
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ctctataact cttgtcctgt 
aacgtttatg gaaggactgc 
tctagtcaaa gccaacacca 
acaggecagg gtctggtctc 
cccattttgt agattaggca 
cacgtcagtc atattccaag 
tctggagtcc atagaaagee 
cagcactttg ggaggecgag 
gggcaacatg gcaaaacccc 
tgcatgcctc taatcccagc 
aggcagcagt tgcagtgagc 
agattccatt tcaaaaacaa 
gaccacctgc gtcaggttcc 
ttctcacagg ttcctaattg 
cttcacaccc actcagatgg 
acatggagaa attggaattc 
atggaaaaca gcttggagtt 
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tcctcaaaaa gttcaacaga 
3180 

ctgagaggat taaaaacagt 
3240 

tcacaagggc caaaaggtgc 
3300 

gtggtctatc catacaatgg 
3360 

gctacagaat ggatgaaccg 
3420 

tcactttact atgaagtgaa 
3480 

ccagaagagg gaagcccaca 
3540 

aggggagagt ggggagaaac 
3600 

attttgcaac tagatagagg 
3660 

ttttttcacc ttaaaatggt 
3720 

ttaattttat gttatgtgaa 
3780 



tagcaggcat ggaacaggtc 
3840 

cccggccagc cttaagcctc 
3900 

taatatccac atgctggcat 
3960 

actgcctcga tacagacact 
4020 

ggttaagctg ggcctttctc 
4080 

aaaacaaagc aactccaaca 
4140 

agaaatgcag ctgtcaaaaa 



Untitled.ST25.txt 
atttcaatgt gacccagcaa 

tactaaaata cacggactca 

taatagccca catgtccatc 

aatattattc ggccataaat 

caaaaaaaat ggatgaacac 

ggccagaaac gaagtccata 

gagacagaat gtgcaatggt 

tgctcaactg gtacaggctt 

tagtgattgc agaacacaga 

taattttcag tcctgagatt 

tttcatccct atacatattt 

actgtggtgc ctgccaagcc 

ttgctgaccg gtggagggca 

gattcctggc cagatggctt 

gtaccttgca accaaaaaat 

tctttgccag ttaaattata 

ggtatcacaa gggcaaagga 

tacagaaata ttoaaccttg 
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ttcccctcta agttatagat 
catatttcta acagtccaat 
gatggatgga taaataaatt 
ggaatgaagt ac tgacgcat 
atgctacaga atggatagcc 
tattgcatca tacaaaatat 
ggatgccagg gtctggggag 
tattttggaa tgatgggaac 
atgtactgaa ttccactgat 
ggataatcat aaaaaaatgg 
taaacctcag aaatatacac 
cggtgatgtt atctggggtc 
gaacctttgc cctaaaagta 
ctttattagc agtaattgaa 
gactcaacaa tgataataag 
tttattatag cttgacatga 
catgaacatt ttatcaaaga 
ttcataataa agtggctggg 
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4200 

ctcagtggtt catgcctgta atcccagtgc tttgcaaggc tgagacagga gaatcatttg 
4260 

aagccagaag ttcaagacca tcctaggcaa gtcagttcaa taccagactt catgtctaca 
4320 

aaacatcaaa aaattagcca ggcatggtga tgcatgcctg ttgtcccagc tactcaggag 
4380 

gctgaggcag gagaattgct tgagcctggg aggctgcggt ggcggtgagc catgattgtg 
4440 

ccattgtact ccagcctggg caatgcagca agactgtcta aataacaaaa ataatagtaa 
4500 

agaaaaggat tgggatgcca tttacttgcg tattcaatac acagagttaa aagtaatttc 
4560 

tacgttttct atttttttat tactaaaaaa agctggacca ttctcacagc ctgaaatgct 
4620 

tctcactttc ccttcttctg tccaaacact tctctatgat aatgcaaaca ctcactcctt 
4680 

taggaagact tcaccccagg tagttccaga tccccttatc tctgccttcc cagaactcct 
4 740 

ggtgtctctc cagttccctc cgtgtggtga agtaccctac ctagggtttc agtatggctc 
4800 

tgtctgcaaa ggtcttgttc acaccttccc ttatggttct gttgccctgt gttgtgtcat 
4860 

agcacagggc acagtggaga acccattcac actgatagag agggccccat agtcctggag 
4920 

ataaccatgt aaccgatcag aataaggcat tgagggctgg gtgtcaggca tgggctgcac 
4980 

ttgggtgggc aggtcccctg gaaagtcact gggtttggca agcttcctac taacatgtct 
5040 

ctctggggtc ccccttggaa cttcatgcaa aaatgctggt tgctggttta ttctagagag 
5100 

atggttcatt cctttcattt gattatcaaa gaaactcatg tcccaattaa agatcataaa 
5160 

gcccagtttg taaactgaga tgatctcagc tgaatgaact tgctgaccct ctgctttcct 
5220 
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ccagcctctc ggtgcccttg aaatcatgtc ggttcaagca gcctcatgag ccattacaaa 
5280 



gtttaattat ttcagtgatt attaaacctt gtcctgtgtt gaccccaggt aaatcacaag 
5340 



ctgaacttct gacaagaaca agctatcata ttcttttcaa ttacagaaaa aagtaagtta 
5400 



attgatagga ttttttttgt ttaaaaaaaa tgttactagt tttgaaaagc taatatgtgc 
54*60 



acatggtaaa cactaagaag gtataagagc ataatgcttt tatactacta agaataatgt 
5520 



tttctctaag ttttttttgg tagatgcttt catcagatta agaaaattcc ctgctattag 
5580 i 



ttgttgaagg tttttatatc ataaatgaaa gttgaatatt attatcatat attattaata 
5640 



tattgttatt gaactatcaa agccttttcc taaaaccatt gagatgatct tataaccatt 
5700 



ctcctttaac ctgttgacga gatcattggt atttatacta tttctctgtt aaccattctt 
5760 



gagtctcagg tttaaattca acttggtcat ggtgtgtcat ctttgatcat tgctgtctgt 
5820 

ggcttgctac tgttttgttt aggatttttg cactgatgct catcaataag actggcatgc 
5880 

catcttcctt tgcagtcctg atttttttct gatttggatc atgtggttat ggccctcatg 
5940 

gaatgagttg ggcatgatgc ctttttttca tgtctctgga ttgatgggac actttggatt 
6000 

ctctccagat ggccctcaat ggtccctgcc tcctcattgt taggcccctq crgcaagccct 
6060 

tctcatttct ggtaggccca ggaacctgtg ggggttttgt ttgtttgttt qtttcttgag 
6120 



tcggagtctc actctgtcac ccaggctgga gttggagtgc aatggcccca tcttggctca 
6180 



ctgcaacctc cacctcccag attcaaqcaa ttctcctgcc tcagcctcct qagtagctgg 
6240 " " 
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aattacaggc acccaccgac acaccctgct aatttttgta tttttagtac agatggggtt 
6300 

tcacaatatt ggccaagctg gtctcgaact cctgatctca tgatctgccc ggcttggcct 
6360 

cccaaagtgt tgagattaca agcatgagcc accacaccca gtgaacctgt ggtttttaga 
6420 

agctccccat gcatgtgaat gctgtgagca tcccaggatg acagccactg tgtgttcagc 
6480 

tgttggaact gtgagaaagc accagtggga ccttctccag cacctgcctg ctgagttcat 
6540 

ggaagaggct tgttggggag atgatgccct ggctgactcc tgaaggatgg ttaggaatgc 
6600 

accagatgga agctgggttg gacccactct atgctgaaga acagcttgtg tggacacaag 
6660 

gagacacgga tatgtcattt ttgtagagcc tgaggagtgt ccaatcacac catttgctta 
6720 

aaacatcatg cacacttgga aaagtggact gagaccgaat gaagaagcta acagtggcca 
6780 

gatcagaaag ggtcttgtgt tacttcctag agatacttag attttatcct gtgggtgata 
6840 

ggagcagttg gagggactga agacaaggaa agaaacatgt ttcaagatct atgtttttca 
6900 

agacgctttt ctggtggctg agtagggaat tccctggata agtcctgccc agggtcaggc 
6960 

aaaacaagtt agggggttac tgaaataagg agtatgagaa atggtgtagg ttgtgctgac 
7020 

gttttgtaac acatctcatg atgatcttca tttccttcac taatttcctg tttcattaat 
7080 

tcccttccac gtgctcttct gaaatttgcc tcacattctc tgatttctct tttacctgtt 
7140 

ggtttcatca ccttttactt tttgctttcc tggaaacaca aatgattctg attgtgacat 
7200 



gtcagaatta tttgcaacat ttgcctttct gctgaaacca tgagttcact gaatacacaa 
7260 



tttagtaaag tgtaggatgc acatgtcgtt ttcgtggtca caaccagctc tgtagcattt 
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Untitled. ST25.txt 

7320 

tataactaca ctggcagtgt gctgggaggt gtagagagaa atatttatca catgtgtggc 
7380 

tgacacaacc tgccaagtta ttttaggagc ctccttggaa tcccagcaag aatgctaccg 
7440 

gcacaatttg taatcacagc atcctgctcc atgccttggc ttcatggcat agtcacttct 
7500 

gcaagtctct ttccagctgt ctgttcccat gtctataaag tatgagttaa atcatcctaa 
7560 

cactactcat cttacaaagt tttcttgctg atgttaagag agttgggaaa aaactgtata 
7620 

aactgtgaag tgccatggag atgttagtgg ttactttatc aagaaataga cactctagaa 
7680 

tggagtagaa agccaacagt tatgattgag tcctcctcct cttcttcttt ttattaattt. 
7740 

ataaagaaaa gaggtttaat tgactcacag ttccatatgg ctggggaggc ctcgggaaac 
7800 

tctcagtcat agcaggaggc aaaggggaag aaggcacctt cttcacaagg cgacaggaga 
78 60 

gagagagctc ctgttctttt ttgtcataaa gtctacagaa gtgcttatac ttcaggacaa 
7920 

gggcaggcag agagaaggaa ggacattgct tcaccccagc cctcactgac gagtttgcta 
7980 

ggggacctca ctttgtccca gagtagggca gaactctggc cactacccat tcagaaggcc 
8040 

tgggctgcac tgctagttcc tcactaactc tgtgtggcct tgggcaagct tgggcctgtg 
8100 

ttaacagatt atgaccctgg gctctcaagc tagaggatct aaatttgaat cctagctctg 
8160 

ctaaagcaat tagtgatgta aactttaata ggtcagttaa ccttcctato gcttagtttg 
8220 

ctcatctgta aaatagggat cataacagta tcaataccac atgattgttg gacagattga 
8280 

atcagttaat gcaggggaag tacttagcat gacacgtatt cactatcatt tcctggagta 
8340 
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agagctgtgt gtgagtgggt gtgagcatgt gtgaaacctt ttctctgcaa tctcaqttaa 
8400 

gaaaccaatc cagaatttaa agttcagggc ctaaatgggt ggttatcttc tcccagttcc 
8460 

atcctatccc acctttgctc ttcctcccgc ccacaggagc tgttggtcct tgattgggct 
8520 

ggaagacctg gtggacccta agtgatctat aagaggagaa tagagaacag ggaatgtctt 
8580 

caaaaatcta gagggacaca gaggctgaga ggcaggcagt cctgcagggt cttctgattg 
8640 

ggacaaggag aaccttggtc ttcacaggcc aattctggtc agtttccccc atggacagat 
8700 i 

gaggaaacag gcccaggaat atccaaggtc tcacacttcc catctgtcaa gtcttgttga 
8760 

ttctgttgta ttcatgtctc tcaaagggag atagagttta gggaagaaag aaggatcaac 
8820 

tgtgtctgat accactggga gcttaagtaa agggttcttt tacttcatag catttatccc 
8880 

aatttgtaat tcagtattat ttgtgtggct gtttggtgtc tctttctcct atatgagtgc 
8940 

tagcttcata agggcaagga ttttgattct ttaatattta gtgcttgcca catgccctga 
9000 

acacagcagg catacaggct aaccaacata cagtggcatg aaagtcatga aagtgagaca 
9060 

cctacctcct ccagtgccaa gagagcataa ccatgcacct gtcactctcc tcaacaccac 
9120 

ccccaagcat gaggcccaaa agcattagct aatcccctcc tccagccact aaaacttaaa 
9180 

ggccaggtgt ggtggctccc atctgaaatc ccagaacttc aggaaacaac agcagaagga 
9240 

tcacttgagg ccaggagttt gagatcagcc tgggcaacat agctaggtcc catctgtact 
9300 

aaaaattagc tgggcgttgt tgcatgcctg tagtcccagc tactaaggag gctgaggtgg 
9360 *■ 
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gaggatcact tgagcccagg 
9420 

ctagcctggg caacagagtg 
9480 

aaacttagat accacgtggt 
9540 

aatggggact tgacagctga 
9600 

gtttacatcc tcaagatcat 
9660 

gcaatgttgc ctccgcatac 
9715 



<210> 3 

<211> 632 

<212> DNA 

<213> Homo sapiens 

<400> 3 

tctagagaga tggttcattc 
60 

ggtcataaag cccagtttgt 
120 

tgctttcctc cagcctctcg 
180 

cattacaaag tttaattatt 
240 

aatcacaagc tgaacttctg 
300 

agtaagttaa ttgataggat 
360 

taatatgttg cacatggtaa 
420 

aagaataatg ttttctctaa 
480 

cctgctatta gttgttgaag 
540 



Untitled.ST25.txt 
aggtggaaac aacagtaagc 

acaccctgcc tcaaaacaat 

caccccaaca tgcaaaatca 

gttacagaga gataatcttc 

gacttgtgaa atttgaatcg 

catcagcaat tcacttggct 

ctttcatttg attatcaaag 
aaactgagat gatctcagct 
gtgcccttga aatcatgtcg 
tcagtgatta ttaaaccttg 
acaagaacaa gctatcatat 
tttttttgtt taaaaaaaat 
acactaagaa ggtataagag 
gttttttttg gtagatgctt 
gtttttatat cataaatgaa 

Page 23 



PCT/AU01/01407 

tataatcaca gcactgaact 
tttaaaaata aataagagca 
agttttcccc tactgagaag 
ttcttctttt tttttttttg 
aatacacatg taattccaga 
actggaagtc aggat 

aaactcatgt cccaattaaa 
gaatgaactt gctgaccctc 
gttcaagcag cctcatgagg 
tcctgtgttg accccaggtg 
tcttttcaat tacagaaaaa 
gttactagtt tttgaaaagg 
cataatgctt ttatactact 
tcatcagatt aagaaaattc 
agttgaatat tattatcata 
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tattattaat atattgttat 
600 

ttataaccat tctcctttaa 
632 



<210> 4 

<211> 11186 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ggatccagtt tcagctttct 
60 

agggaatcct ttccccattg 
120 

atgtgtggtg tttgttctga 
180 

tactagtacc atgctctttt 
240 

gtgattcctc cagctttgct 
300 

tggttccata tgaaatttaa 
360 

acttgatggg gatggcattg 
420 

tattgattct tcctatccat 
480 

ttttgttgag cagtggtttg 
540 

qattactaga tattttattc 
600 

gctctctttc tgttattggt 
660 

cctgagactt tgctgaagtt 
720 

ttttctaaat atacaatcat 
780 



Untitled.ST25.txt 
tgaactatca aagccttttc 

cctgttgacg ag 

acatatggct agccagtttt 
cttgtttttg tcaggtttgt 
ggcctctgtt ctgttccatt 
ggttactgta gccttgtagt 
ctttttgctt aggattgtct 
agtagttttt tttccaattc 
aatctataaa ttaccttggg 
gagcatggaa cattcttcca 
tagttctcct tgaagaagtc 
tcttagtaac aattgcaaat 
gtataggaat gcttgtgatt 
gcttatcagc ttaaaaggat 
ggcatctgca aacaggaaca 
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ctaaaaccat tgagatgatc 



cccagcacca tttattaaat 
caaagatcag atggttgtag 
ggtccatatc cctgttttgg 
atagtttgaa gtcaggtagc 
tgggaatgtg ggctcttttt 
tatgaagaaa gtcattggta 
aagtatggcc attttcacga 
tttgtttgtg tcctctttga 
cttcacctcc ctttaatttg 
gggagttcac tcatgatttg 
tttgcgcatt aattttgtat 
tttgggctga gacgatgggg 
atttgacttc ctcttttcct 
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aattgaatac cctttatttc 
840 

atgttgaata agagtcatga 
900 

cccattcagt atgattttgg 
960 

cgttccacca atacctactt 
1020 

gaaggccttt tctacatcta 
1080 

atgtgatgga ttacatttat 
1140 

aagcccactt gattgtagtg 
1200 

attttattga ggattttggc 
1260 

tttgttgtgt ctctgccagg 
1320 

gggaggattc cctcattttc 
1380 

tctttgtacc tctggtagaa 
1440 

taggctatta attattgcct 
1500 

ttcttcccgg tttggtcttg 
1560 

attttctagt ttatttgtgt 
1620 

tgtgagatcg gtggtgatat 
1680 

tcttttcttc tttattattc 
1740 

ccagctcctg ggtttcactg 
1800 

ctcctgtgat cttagttatt 



Untitled.ST25.txt 
tttttcttgc ctgattgccc 

gtgagggcat cgttgtcttg 

ctgtggtttt gccataaata 

tattgagagt ttttagcagg 

ttgagacaat tatgtggttt 

taatttgcat atgttgaacc 

gataagcttt ttgatgtgct 

atcaatgttc atcagggata 

ctttggtatc aggatgatgc 

tattgattgg aa'tagtttca 

ttcagctgtg aatccatctg 

caattttagg gcctgttatt 

ggagggttta tgtgtccagg 

agaggtgttt atagtattgt 

cccctttatc attttttatt 

tggctggcgg tctgtcaatt 

attatttgaa gggttttttg 

tcttgccttc tgctagcttt 
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tggccagaac ttccaatact 
tgctggtttc aaagtttttg 
gctcttatta ttttgagata 
aagggctgtt gaattttgtc 
tttaatcgtt gattctgttt 
agccttgcat cccagggatg 
gctggattca gtttgccagt 
ttggtctaaa attctctttt 
aggcctcaga aactgagtta 
gaaagaatgg taccagctac 
gtcctggact ttttggttgg 
ggtctattca gacattcaac 
aatttatcca tttcttctag 
ctgatggtag tttgtatttc 
gcatctattt aattcttctc 
tttttgatct tttcaaaaaa 
tgtctctatt tctttcagtt 

i' 

tgaatgtgtt tgctcttcct 



WO 02/36784 PCT/AUO 1/0 141)7 

Untitled.ST25.txt 

1860 

tctctagttc tttgaattgt gatgttacag tgttgatttt agatctttcc tgctttctct 
1920 

tgtggtcatt tagtgctata aatttccctc tacacattgg tttacatgtg tctcagagat 
' 1980 

tctggtatgt tgtgtctttg ttctcattca tttcaagaac atctttactt ctgccttcat 
2040 

tttgttattt gcccagtagt cattcaggag caggttgttc agtcttcatg tagttgtgtg 
2100 

gttttgagtg agtttcttaa tcctgagttc taatttgatt gcactgttgt ctgagagaca 
2160 

gtttgttgtg atttqcattc ttttacattt actgagcatg ctttatgtcc cattatgtgg 
2220 

tcaattttag aataagtgtg atgtgatgct gagaagaatg tatattctgt tgatttgggg 
2280 

tgtggagttc tgtagatgtc tattcagtcc actgggtgca gagctgagtg gacatgaaca 
^2340 

ttttatcaaa gaagaaacac agctatcaaa aatccagaaa tattgaacct tgttaataat 
2400 

aaagtggctg gcctctggtt cattcctgta atctcagtcc tttgaaaggc tgagaaagga 
2460 

ggatcacttg aggccacaag ttcaagacca tcctagacaa gtcagttcaa gaccagactt 
"2520 

catgtctaca aaacatcaaa aaattagcca ggcatggtga tgcatgcctg tcatcccagc 
2580 

tactcaggag gctgaggcag gaggattgct tgagcctggg agattgaagt ggcagtgagc 
2640 

catgattgtg ccattgcact ccagcctggg caatgcatca agactctgtc taaacaataa 
2700 

taataataat agtaatagta ataataataa taataaagaa aacggttggg acgccattcc 
2760 

ttacttattc aatacacaaa gttaaaagca atttctactt tctctatttt tttattacta 
2820 

aaaaaagctg aaccattctc acagcctgaa atgcttctca ccttcccctc ttctatadaa 
- -28 8 0" 
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acacttctct gttgatgata 
2940 

agttccagat ccccttatct 
3000 

gtgtggtgaa gtacccccac 
3060 

acattttccc ttatggttct 
3120 

acccattcac accgatagag 
3180 

aatagggcat tgagggctgg 
3240 { 

gaaagtcaca ggatttggca 
3300 

ttcatgcccg atgctggatg 
3360 

gaaactcatg tcccaactaa 
3420 

tgaatgaact tgctgaccct 
3480 

agttcaagca gccccatgag 
3540 

gccctatgct gaccccaggt 
3600 

ttcttttcaa ttacagaaaa 
3660 

tttggaaagg taatttgtgc 
3720 

act get gaga ataatgtttt 
3780 

aattcactgc tgttagttgt 
3840 

atgtattatt aatatattat 
3900 



Untitled.ST25.txt 
atgcagacag tctctccttt 

ctgccttccc agagctcctg 

cttgggtctc agcatgactc 

gttcccctgt gttgtgtcac 

agggccccat ggttctggag 

gtgtcaggca tgggctgeae 

agcttcctag taacatctct 

ctggtttatt ctcgagagat 

agttcataaa ctccagtttg 

ctgctttccc ccagcctctc 

gcattacaat gtttagttat 

gaatcacaag ctggacttct 

agtaagttaa ctgataggat 

acatggtaaa caagaaggta 

ctctaatttt ttttggtaga 

tgaaggtttt ttatatcatg 

tattgaacta gcaaaggctc 
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aggaatactt cacaccaggt 
gtgtctcccc agttccctct 
gttctttgaa ggtcttgttc 
agcactgggc agagtggaca 
ataaccatgt aactgatcag 
ttgggtgggc aggccccctg 
ccctggggtc ctcttggaac 
ggttcattcc aataatcaat 
taaactgaga taatctcagc 
agtgcccttg aaatcatgtc 
ttcagtgttt attaaacctt 
gacaaggaca agctatgata 
tttttaaaga tgttactagt 
taagaggata atgettttat 
tgetttcate agattaataa 
aatgggagtt gaatattatc 
ttcctaaaac aattgagatg 
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atcttataat cgttctcctt 
3960 

tgttaactat tcttgagtct 
4020 

acactcctgt ctctggcttg 
4080 

tgagactggc atgtcatctt 
4140 

ccctcattga atgagttggg 
4200 

ctttggagtc tctccagatg 
4260 

gcaacccttt ctcatttctg 
4320 

tttgtttgtt tgttttttga 
4380 

caatggcctg atctcggccc 
4440 

ctcagccttc tgtgtagctg 
4500 

atttttagta gagacggggt 
4560 

ctcaaattcc tgacctcatg 
4620 

tttgtgcctc cacacacagt 
4680 

tgtgagcatc ccgggatgac 
4740 

cagcgggacc ctctccagca 
4800 

gatgccctgg ttgactcctg 
4860 

cccagtctat gctaaagaac 
4 920 

gcagagcctg gggagtagcc 



Untitled.ST25.txt 
taatctgttg atgagatcat 

caggtttaaa ttcaacttgg 

ctactattgt gttcagcatt 

cctttgcggt cctgattttt 

tgtgatgcct tctttttcat 

gccctcaatg gtccctgcct 

gtaggcccag gaacctgtgg 

gttggagtcc tgctttgtct 

actgcaacct ccacctcctg 

ggattacagg catccaccac 

tttacaatat aggccattgt 

atctgcctgc ctcagcctcc 

gaatctgtgg tttttaaaag 

agccactgtg tgtccagctg 

tttgcttgct gtggtcatga 

aaggatggtt aggaatgcac 

agcttgtgtg gacacaagga 

aatcgcacca tttgcttaaa 
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tggtatttat actttttctc 
tcatggtgta tcatctttga 
tttgcactga tgccgatgaa 
ttcagatttg gatcatgtgg 
gtatctggat tgatgggaca 
cctcattgtt aggctcctag 
gttttatgtt tgtttgtttg 
cccaggctgg ggttggagtg 
ggttcaagtg attctcctgc 
cactcctggc taatttttgt 
gatctcttgg acaggctagt 
caaagtgctg agattacagt 
ctcctcatgc atgtgaattc 
ttaaaactgt gagaaagcac 
aagaggcttg ttggggagat 
cagatggaag ctgggttgga 
gacacgaaca catcattttt 
acaccgtgta cagttggaga 
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4980 

agtggactga gacaggctga 
5040 

cttcctagat atacttagat 
5100 

ccaaggaagc atgctttaag 
5160 

gaattccctg gataagtact 
5220 

aggagtatga gaaatggtgt 
5280 

tcatttcctt caccaatttc 
5340 

gcctcatatt ctttgatttc 
5400 

tcctggaaac acaaatgatt 
5460 

tctgctgaaa catgagctca 
5520 

ttttcatggt cataaccagc 
5580 

gtgtagagag aaatatttat 
5640 

gccttcttgg aatcccagca 
5700 

ccatgccctg gcttcatggc 
5760 

catctataaa gtatgagtta 
5820 

gatattagga gagttgggaa 
5880 

gttactttat caagaaatag 
5940 

gtcctcctcc tcttcttttt 
6000 



Untitled.ST25.txt 

agaagctaac agtggccaga 

tttatcctgt gagtgatagg 

attccatgtt ttttgagatg 

gcccagggta ggcaaaagaa 

aggttttgct gatgttttgt 

ctgtttcatt aattcccttc 

tcttttacat gttggtttca 

ctgattgtga catgtcagaa 

ctgaatacac aatttagtaa 

tctgtagcat tttataacta 

ctcatgtgtg gctgacacaa 

agaacaccac tgatgcaatt 

ttagtcacgt ctgaagtcta 

aatcatccta at act act ca 

agaactgtat aaattatgaa 

acactccaga atagagtaga 

ttttaattta caaagaaagg 
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tgagaaaggg tcttgtgtta 
aacagttgca gggactgaag 
ctgtctggtg gctgagtagg 
gctaggaggt tactgaaata 
aacacatctc atgacaatct 
cacgtgctct tctgaaattt 
tcacctttta ctttttgctt 
ttatttgcaa cattcccctt 
agtgtaggat gcacatgttg 
cactggcagt gtgctgggag 
cctgccaagt tgttttagga 
tgaaatcaca atgtcctgct 
tttctaacta tctgtttcca 
tcttacaaag ttttcttgct 
gtgccatgga gatgttggtg 
aagaaaacag ttatgattaa 
tttaattgag tcacagttcc 
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atatggttgg ggaggctcag 
6060 

accttcttca caaggtggca 
6120 ^ 

cagaagtgca tatacttcag 
6180 

ccagtcctca ctgacaagtt 
6240 

ctctggccac tacccattca 
6300 

gtgtccttgg gcaaggctgg 
6360 : : 

agaatctaca tatgaatcct 
6420 

cagttaacct tcctgtggct 
6480 

ataccatggg ttgttagaca 
6540 

cgtattcact atcatttcct 
6600 

aaccctttca ctgcaatctc 
6660 

atgggtggtt atcttctc'cc 
6720 

ggagctgttg gtctttgatt 
^6780 

aatgaqaata gaggacaggg 
6840 

gcaggcagtc ctgcaggggt 
6900 

aattctggtc aatttccccc 
6960 

tcacacttcc catctgtcaa 
7020 



Untitled.ST25.txt 
aaaacttgca atcatggcag 

ggagagagag agctcctctt 

ggcaagggca ggcagggaga 

tgctttggga cttcattttg 

gaaggcctga gctgcattgc 

gcttatgtca aaagattatg 

ggctctgcta gagcaattag 

tagtttgctc atctgtaaaa 

gattgaatca gttaatgcag 

tgagtaaaag ctgagtgtga 

agttaagaaa cccatccata 

agttgcatcc tatcccacct 

gggctggaag acctggtgga 

aatgtcttca aaactcctag 

cttctgattg ggacaaggag 

atggacagat gaggaaacag 

gtcttgttga ttctgttgta 
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ttggcaaagt ggaagaaggc 
cttttttgtt gtaaagtcta 
agaaaggaca ttgcttcacc 
tcccagcata tgggacagag 
tagttcccca ctaactctgt 
accctgggct ctccagctac 
tgacgtaacc ttggatgggt 
tagggatcat aacaacatca 
ggtaaatact tagcatgaca 
gtgggtgtga gaatgtgtga 
atttaaagtt cagggcctaa 
ttgctcttct cctgcccgta 
ccctaagtga tctataagag 
agggacacag aggctgagag 
gaccttggtc ttcataggcc 
atccaggaat atccaaggtc 
ttcatgtctt tcaaagagag 
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agagagttta aggaaagaaa 
7080 



aagggttctt t tact teat a 
7140 

tgtttagtat ctctttgtcc 
7200 

tttaatattt agtgcttgcc 
7260 

acagtggcat cgaagtgaga 
7320 

ctgtcactct cctcaacacc 
7380 

ctccagccac caaaacttaa 
7440 

taggagacag cagcaggagg 
7500 

tagctagaca cca tctgtac 
7560 

ctactaagga ggctgaggta 
7620 

ctataaccac agcactgaac 
7680 

tttcaaaaat aaataaataa 
7740 

aatcaagttt tcccc tactg 
7800 

tetgectttt tttttttttt 
7860 

atctaataca caaatcattc 
7920 

ggccactgga agtcagaaca 
7980 

aaaggatgaa aa tateggag 
8040 

tceactcatg- ctgatatctg 



Untitled.ST25.txt 
gaaggatcaa ctgtgtctga 

gcatttttcc caatttgtaa 

tattagagag atagcttcat 

acatgccctg aacacagcag 

cacctacctc ctccagtgcc 

acccctaagc atgaggcccg 

aggecaggtg tggtggctcc 

atcacttgag gecaggaatt 

taaaaattag ctgggcatgg 

ggaggatcac ttgaacccag 

tccagcctga gcaacagagt 

aaacaaaact tagataccac 

agaagaatgg ggacttgaga 

tttggtt tac atcctcaaga 

cacagcaatg ttgcttctgc 

agcttcccag aagagaggta 

tgaaggtgtt ccttgcatca 

agectaatge ttctctgaat 
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tatcactggg agcttaagta 
ttcagtatta tttttgtcac 
caggacaagg attttgattc 
gcatacagac taaccaacat 
tagagtacat gtccatggac 
aaagcattgc taatcccctc 
tatctgaaat ctcagaactt 
tgagacgagc ctgggcaaca 
tggtatacct gtagtaccag 
gaggtggaag ctacagtgag 
gagaccctgc ctcaaaacaa 
gtggtcaccc caacatgeaa 
gctgagttac agagagataa 
tcatgacctg tgaaatttga 
ctaccacgag taattcactt 
ccacttggac taccaatata 
ctgagtccct ggacagcctg 
gttgagattt aactttgatc 
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8100 

caatgaaacc agaccaagaa 
8160 

ctcacaattt tatgattatt 
8220 

tctccatgtg cttagggtac 
8280 

actgacagca cagctttaaa 
8340 

aggcaagtgg tgcttgggtg 
8400 

agctctcagt aggcaagtcc 
8460 

cagaacatgc cttcaattat 
8520 

gagtgctggc tgcatttgaa 
8580 

caaccagtca caccacctcc 
8640 

aatcatctta gccaagtaag 
8700 

agttttgttt ctaaaaaaat 
8760 

atagttatca gatgacactg 
8820 

tgtggcaaca tagagaggga 
8880 

ctttctttct ccctgctgag 
8940 

ggagtaaccc ttgagacagg 
"9000 

t tact a tttg ccatagtgat 
9060 

cctaatatct cccttcctgt 
9120 
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agaagaaacg tctttcattg 

tttccttagc tgtcctataa 

aaagttgacc aagaccaaga 

aacatgatga atgctttaat 

ttcttccaat gcatagtatc 

ctacatgtta gaagatgtta 

tttcctttgc agaacaacac 

ttccaagcaa cgattagtct 

tgaagtttgt tgggcaggta 

tgttttagag gaaagaagaa 

aagaaaggct ctttgtttag 

gaatctttac ttgccagaat 

gatcctccag caatgccatt 

atggggtcct ttgttctaga 

aacacgaatc tcaaccaatt 

gcttagcaag gaatggcagg 

taacttttta taatatttta 
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ttgataagga catgattttt 
ttatctgctt atttgtctct 
ataatgtctg ggagcacaat 
acaggaa-atg agtaggggag 
ttccttgaca cagtcagtgc 
ctttctgtgg aattaggtgg 
caatttcatt agttaggaca 
atcactgttg gtatagattc 
aatcttcatc ttagaataaa 
aacataatcg tttccataag 
gtgagctaat gaagttgttg 
gtgttctgtg cacctctcgg 
gatatggtca gaaactgcat 
aaacccaggg ggtgccactg 
tctggttgca gccttgagtc 
tgcaccagag cagcagagga 
ttgtgatcag tatcagttrgg 
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gaagctactt gcagtcactg agcctcagtt tctacatctg taaactgggg atagtagcat 
9180 

ggccctattt aatgtgctca gcgaagccac tgaaaggaga cagaaatgta ccagaattcc 
9240 

ctggactttt atcctacttc tcctggggat tgtcacccac ctacccgtgt ctgtcctttg 
9300 

ttgctttgac gctgtcactt cttttcttag gtacctctct gtagggctcc attattccag 
9360 

ggattccaga gttacagcac atgcatacct ccatccaagc atgtttattt gtctcctgct 
9420 

tcactaggct gtccccaagg aacatgtggc tcccggcaca tacctggcac aacactgcac 
9480 i; 

atgacattca cccacttggc cttgaatctg acaaggaatc tggcatgatg ttcacctgct 
9540 

gaggccaggt gccgagcagc cctggaggct taggggccag agggatggga aaaggtgtct 
9600 

ttctggggtg agtatcagtt tctgcaggag tgctgaacct gagaaagaat aaagagagaa 
9660 

ggaagtgaac aagcacagct taaacatcat ctgtttctac tgagttttaa caactctgag 
9720 

attttgtttg tcatggaatc catttctcag gccaagcaga cacagaactt gggtgtgagt 
9780 

gatgataatg agctgatata attttcacac cctcatcact gagatctctc ccatcaggaa 
9840 

tgggtcacag ggctcacagg tggcagcaac tgttattaca ggcctcatct ctaccagctc 
^9900 

ctggcacctg ctctcctctc attagaaaat cctccacttg tcaaaaagga agccatttgc 
9960 

tttgaattcc aattccaccc tcaagaggct gggaccacct cattggagtc cttgatgctg 
10020 

tgtgacctgc agtgaccact gccccattgt tgctggctga ggtggtttgg gtcaacctgg 
10080 

ccatctgggc agctgttctc ttctcttctt tctcccctac tgtttccaga catgcagtat 
10140 *' 
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ttccagagag aaggggccac 
10200 • 

gacttttgaa agctacagag 
10260 

tccatctcat ccatgccatg 
10320 

gacaaatctg aaagtgcaga 
10380 

atgacctaag aagatggagt 
10440 

gcaatctcag ccaagtcaac 
10500 

tgatagggac cagagccatg 
10560 

tctgctaagg gaaacaggcc 
10620 

gaactgcatg aaggggatgg 
10680 

ggttggtgag gaaagactct 
10740 ■ 

ggagtcttct agggggcaca 
10800 

tgttgggatg agtttcaagt 
10860^ 

aatagatttt atgccaatgg 
10920 

tccccagtaa cattgattga 
10980 

tcagtgagtg gtgtgtgtgt 
11040 

ctgcaggcag agcacggggg 
11100 

agcatataaa caatccaaca 
11160 

cacacacagc ccagcaaaca 



Untitled.ST25.txt 
tctttggcaa agaacctgtc 

gaagaagcac aaattgatgc 

tctctttttt aggggtcctc 

gacagcagct gaggcacagc 

ggtcaccaga aagtcagagg 

tccaccagcc tttctggtcc 

agagtgagta agaccagact 

tggaaacaca caatggtggt 

aaagtgccca ggggaggaaa 

aagagaaggc tctgtctggc 

ggcacactcc aggcataggt 

attctggaat gaggacagcc 

ctccacttga gtttgtgata 

gttgtgtatg attctacata 

gattatttgc caactgccga 

ccctgctact ggctgcagct 

gcctcactga atcactgctg 

gcagca 
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taactttcta tctacggcag 
tattccacta agccatcagc 
ttgccaacag aatcacagag 
caagagctct ggctgtatta 
aagtgacaca caggggccca 
ccactgtgtg tacagcaccc 
atgcccttga ggagctcacc 
aaagaggaaa gaagacaata 
tggttacttc tgtgtgaggg 
tgggtatgaa aggatgtgta 
aaagatctgt aggcatggct 
atagagacaa gaggagagtt 
agaacccaga acccttggac 
gaatattaac tcaatggagg 
ggtggagaag cctcttccga 
ccagccctgc ctccttctcc 
tgcagggcag gaaagctcca 
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11186 



<210> 5 

<211> 10200 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gccaattaga agaaacacaa ctacaaggtc agggcatatt attcaaacag tagagacaat 
60 

acagtcaaat atttggcaga attacaaaat atctcattgg aaaagacacg caagggaaat 
120 

caacaaaaag atatgaatca gaattcatct gtgtctcaag aaaaggtcat gcgataaatt 
180 

aagttctgct agtgtltcta cactaccgtt agcctcatta ccttattttt taagtgttaa 
240 

tatagtttta ggtattttac atacattttt attattaatt acaaccaaag tgcaacttgt 
300 

aatagcaatt ccttcacatt ttttttttca aatcttgcac cttaaaatcc acctcgggcc 
360 

tcagttggcc agctttggta tctgatactt ggactacaga taccactaag gcaagtagat 
420 

aaaatgtact ctaggaccta cagcccttct gctagatcct gaagaatgat cattaaaaca 
480 

agctggtcta gctggtcaag agcaaaaata aaatcaagat gacagaaaat tgatgcaaaa 
540 

gtgaagtaaa atagctagag aatatgattg cgcctgtccc cttagcatgg attcccatgc 
600 

tagccaatct aaaatcctca ctgttagaat cctcctgtca atatgataga atgaacagca 
660 

agctcagtgt cagaaaacct gtgttgttaa cttggccctc tttctagctg aatgtgtgtt 
720 

tttggtcaag ttctttggca tttcagagac tcagagtagt gaaggaagtg gataagatga 
780 

cctctacatt ctcttgcaag ctcaaacatc tatgaatcca gagagaaaaa ctagagcatg 
840 

aaattaaggt tattttaaag aaataacctt aaaattatta gtattcgagg atctccaata 
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900 

tattcatggc accactcaaa actttccttc tgctctatcc cgtcttggct caaagttatc 
960 

tccttaatga ggtctgccct gactatccta cttaaaattg taaactttgc ccacctggta 
1020 

cttccactct ctttcccctg ctctgttttt caccgtaata ctttactctt tttaacatac 
1080 

aaaatcactt atttactgtg ttgttatcta tctgcctact cttaccatca aatataagtt 
1140 

ctacctaggc agggattttt gtatgttttg ctcatggata tatacgaagc acttagagta 
1200 

atatgtgaca tatacagggt acttgattaa tactgttgag tgaatgaatg agtttccaat 
1260^ 

acaaatttaa aataaaatat ttcctaactt aaaattgtaa agtcagatct aaccaactgt. . 
1320 

tcattggtct gctagcagtg tttcttgtat atggaaatat attttaaata gatatgtcct 
1380 

gtgaaataat actaagtgtt ctaaagaaat aagtgagtga acgttacctc attgaactaa 
1440 

cttgaccttg ctcctgggag agagttcatt tgagattaaa caagttcaaa gtctatgaat 
1500 

cataaaacga taaaaaaaac taaaagggaa atggtgtttt tataagctct gcaattcaaa 
1560 

agccatttcg ggtaatattg ttatttttat gtcaggaatt cctcagtgct gatatcttag 
1620 

ggcaaagggt ttggttataa attaagagaa tgaggaaata ggtacatagt aggattgttc 
1680 

caaccaaata tgtgttgaat gtcaaaggaa tttccctgag gaataatctt cagaataatt 
1740 

tgctaagcac aggagaaaat ttggcttatt actttatagc cagatttcat ttttaattga 
"l800 

aacttctttc aagcaaatca cttactagtc tattaacaat aacaacataa acacaagtaa 
I860 

acattcggaa tatagacatc caggtactaa gctgattgct ttacactcac tgtcttattt 
1920 
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tacaagtaag gagttttagt tgcagcaaaa gaaataaatt ttccaatgtc aaatgaccag 
1980 

aacttaaacc caatctgttt ggtgctaaag ccaatgttct ttactgcaat gttgggttat 
2040 

cttgtttcta aaacttaaat ttatcagtaa aaggcaaaat ttgctattat tgaggacatt 
2100 

aaaatcatat ttttgtagac tctgaggaca aatccaacaa aaaagttcca actatttctt 
2160 

ggcaggcatc attgaaattg gtatatagct tccttgggta ttgactttga aaaggaagtt 
2220 

ggtcacttta gatatataag ttcagtctgt ttgtaaaaac aaaatgaaaa caaaacagtt 
2280 , 

gccttatatg ctaaaattat cctaatcgtt ttcaccttta acaacatata cacacagaac 
2340 

ttgaggaact ttacacggct catcttcata ttgtcagcat ctagcaaagt acttgccaca 
2400 

tagtgatcaa taaaagtttt agccagcctg ggcaacatag tgacagccta tctctacaaa 
2460 

aaaaaattag ccaggcaagg tggcgcacac ctttggtccc agctacttga gaggaggatg 
2520 

tggggagatc ctttgaatgc aggaggttga ggctgcagtg agctgtgatt gcgccactgt 
2580 

gcttcaacct tggtgaca'ga gcaagaccct gtctcacaca cacacaaatt aagtaaagaa 
2640 

aaaaaaaaga atcaaagaaa aaaataattc cccagcttaa gtccatcttt atttgtttgg 
2700 

ataagctata aagtgtcaaa taatgctgtt aatggacatt tctctagctc tcccaaagga 
2760 

ggaattgagc acatagtatg tgctgtattt tatatacaga ataaaaatag agacaagatt 
2820 

tctaccctca cagaacttaa attcttcagg agaatgacac tgaagtcctt aattggactt 
2880 

ctctcttctg tattatcttc ctcaagtgga ggtatatggt gcttagttat gaaaaatacc 
2940 
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tccagggctt tgatcttctc 
3000 

aaaaccatgt atccaactta 
3060 

agcctcccat gtattttctt 
3120 

tcttcctctc tcacttttct 
3180 

ttaataaatg aaaaatactg 
3240 

ccttcattca atgtgagtgt 
3300 

ataaccagga ttctgaaaat 
3360 

ataaaataat acacttttaa 
3420 

tcagacctaa ctttcatcac 
3480 

taatattaca ccaattaatt 
3540 

tgccttttgt ccttgcatat 
3600 

cacgttttat attactactt 
3660 

tgtggttaag tattcaggtc 
3720 

atacctttgg ctgatatttt 
3780 

aaattgaatt actgagtcaa 
3840 

tgaataattt tcaactccta 
3900 

tttcaaaata tctttgctgt 
3960 

tatgttttct tctgtgtacc 



Untitled.ST25.txt 
aataactctt tgaggctgat 

tatagacagt tgatgaccaa 

attacttaag gagaatctct 

tagaaacatg ggtaagattt 

ggaattctca atgtttcttg 

cccttaataa ggagcaaact 

gcattctcat ccccatctcc 

tataggaaat ttctcaaata 

taaaagataa tcactcttga 

tgcttgatat agtgaatatt 

tagcataggt attttctgag 

ttttaaagag gatgtataat 

actgctcatt tttcactgtt 

tttctgtact tggaattatt 

acagacttaa gttttcctta 

cttgtttatt taactcttgt 

tgtaatttgc atttctttgt 

agtatactac atactttttt 
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atgaaaacag taattagaaa 
agctagaatc cagttatttc 
atctctacct ctttctctct 
tcagaaatat gagaaactta 
ttttagccag ttaattttgg 
ccactgagag atagatacta 
aaacttttat aaaaaatatt 
cagaaaaaat taaagtaaac 
cactttgata tctttatctc 
atgctattat aattttcccc 
gttatcacaa actctgtaag 
aattaattca tccatatata 
ataaaataaa gcagcaatga 
tctttaagat agatttccta 
tgtatgtttc cttattcatt 
gagcatgtga tagtctcatt 
agttagcatg aatatttcag 
atatgaattg cctatttgca 
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4020 

tcttttgctc ggttctatta 
4080 

atatattaag aatattaaac 
4140 

tggttttaaa tgtgaattta 
4200 

ttaaaagtaa taaaattatt 
4260 

ttccagtcat cacttttact 
4320 

ccatctttat tagga£attt 
4380 

accttatttt cttcttacaa 
4440 

cacagaaatc aaggtttttg 
4500 

ttttgagcag tggagtattt 
4560 

ccaagagttg ccacttcatt 
4620 

cttaattttg tcattcattt 
4680 

agggggcagt gacttactca 
4740 

ttgatagagg aggctgggcg 
4800 

aggtgggtgg atcacttgag 
4860 

ccatttctac taaaattaca 
4920 

ctactcagga ggctgaggca 
4980 

cctggattat taacactaca 
5040 
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gacctttgaa ttttttctta 

tattgtgata tttgcaataa 

ttcaattttc ttcataattt 

gccatattat tatttatttt 

ctcatgttct taatttttat 

gggaaataat acaatttata 

aaaccctagg aatatgctgt 

agcagtggag tatttcttca 

cttcaaagcc acacagctag 

ttcttctttc cctttatctt 

aaaaacattt cttgtgctat 

gagagatgat tctctaatgg 

tggtggctca tgcctataat 

gccagaagtt caagatcagc 

aaaattagcc tggtgtgctg 

tgagaattgc tggaacccag 

ctccagcctg ggagacagag 
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tccatttata taagctcttt 
atagctatat ggtttgttgt 
tgttgttttt atagatttct 
caatgtccat tactagccct 
tcatatcttg gctccattga 
ctaaacacac atcaaatctt 
ttttgtcttt atttgaatga 
aatgacacag aaatcaagtc 
taagtcatga agctggaatt 
actcagttgt cttctctcct 
tatggtagat ttattttaat 
agttttaaag atcttagaag 
cccagcactt tgggaggctg 
ctagtcaatg tggtgaaacc 
gtgcactccc ataatcccag 
gaggcggagg ttgcagtgcg- 
taatactcca tctcaata'aa 
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aagaagttga taagggagat 
5100 

cttaggcatt tagcctacta 
5160 

gcacctaata ttttctacaa 
5220 

tttgaatatt taaaaaattc 
5280 

atgccgatac tgaaaagtag 
5340 

ggggaagtgt tcaaagagga 
5400 u 

gctaactcac taagcgactt 
5460 

tcacacaact catgtctgtt 
5520 

aaatqatcct attgaccttc 
5580 

caatctgatg taatccaaca 
5640 

agtcatttaa tatcccctat 
5700 

agggtggaga gtgtgtggat 
5760 

gttattattg ttaacacaag 
5820 

tattctttct ctgcctcagg 
5880 

gcatgttgaa ccttcaatct 
5940 

gaaaataaat tattagagag 
6000 

gagaaaaaca tccaaaagaa 
6060 



Untitled.ST25.txt 
agttcatggc aacggatctt 

gtgtaatttc cataaatctg 

acatttattg aaactttatt 

ataatcatat gaaacattaa 

gattgcggag tggtagaaaa 

gtataattca ggtttccatt 

aattaaaatt aaattaattt 

gctgtattgg ctaaatgatg 

ggaatctgga tttttttttc 

tgagttcaag cacagtcatt 

catcacatgt ccttcataca 

cccttattat tgtgttattg 

tgagtcatat gtcttgctct 

tcaattaact tcattgaggt 

ggactcagat gggctaaata 

atcagagaaa gatacataag 

ttaaaaacca taggagaagg 
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tgaaggcacg ctaatgataa 
cctctgatgt catactctca 
ttgtataagt ctctgtccag 
taataaatac aaaatgagag 
tatttctggc tgtagtagat 
tgccatcgac ttatcacatg 
atcatcatct gatcaccatt 
gcaagacaaa cgacctctga 
aatgcaggtg tccatagaag 
taatatcccc tatcaagtac 
taaaaatcat tacatgtgaa 
taacacaata acaatattgt 
ttggactgag tggaaacttg 
gagttgcatt ccttctttaa 
gaggagctag gaaaaataca 
atttacaata aaagaattat 
aaaaataggt gaacagcttt 
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ttaatttcta taaatgtgtt gttaatactc ataataaagg actcagagct gggatatgag 
6120 

aataataggt caaacgtata tggatacata gatgtgacta catacatgag ttgcaaagaa 
6180 

tgctaaggag ggcaaaaaga gattgagaag agggcattat tactaatata tagcaatgtt 
6240 

gaatgtttag ggtgtttcag gcactgtaca aatcttttaa atacacaaat cacttaatct 
6300 

tgccataaca ttagaagata ctatctactc tttaccaaaa aggtaactgt gggatagcag 
6360 

agttaagtaa cccgttcaaa cctatgcata ataatcagca gagatggtcc ttatctaagt 
6420 

ttttctgcct ttgaagtcca aatagtttaa tgcagccagg tactaaagaa gaaaactttt 
6480 

gtaaattagt ttagtttaat gatttacatg tggaaaagca cagagtgaaa agcacatatc 
6540 

atctgagaag cccagtgagt ttggctgaaa tggagtgaac atgtacatgt tgagggtgag 
6600 

gaagatgatt agagagaagt gatgtgttgt ggttcttaaa agctaagagg aaactgttag 
6660 

atatgatata gtctgtggca gggagccatt gcaggttttc caagatggac tcataaggag 
6720 

aaaacacttc gtgggact^g aaaaggcagt gaagtggtgt gtcctatgat aatgtactac 
6780 

agtgtaggat agtggttaag agtacagtta tgagagaggg actactggtt agcaccttac 
6840 

ctgctgtgtg actgggcaaa taatgcaaac ctcagtgtct tttattgtaa tatgggagta 
6900 

acaaaaatag taactacttc ataggattct tgtaaagatt aaatgactta atttctttga 
6960 

agtgcttggc agttcctgat aaatgaccag tagttaataa atgttagttg ttattattat 
7020 

cattatatat tattactccc atagatacat atagaacaga ctgcagcaga gaggcaaatc 
7080 

tttaatgttg tcagagtata gacaagttgg tgaaatggct acatgagagc ggaggacaag . . 
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aaggtgcaga ttgtggcagt 
7200 

gagtgtcaac ttgcaagggg 
7260 

aacagccaag ttcagcttga 
7320 

aagaaaccaa ggactgttga 
7380 

atattcacac tttcatgtca 
7440 

aatcttattt ttaa^atctg 
7500 

atctttacct gatgctcaag 
7560 

tatttatatc ctgcactgtt 
7620 

caacaagatt acataaatga 
7680 

agagatcata ggcacaaata 
7740 

ttaccccaga tggaccacag 
7800 

cttgcatatg caattagaac 
7860 

atttgtgaaa aaaaaaaaaa 
7920 

tccctgcact ttgggaggcc 
7980 

tggctaacac agtgaaaccc 
8040 

gcgggagcct gtagtcccag 
8100 

gaggcggagc ttgcagtgag 
8160 
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cacttcaaat ggaaatatca 

accaggtagg tttcatcaga 

cagaagtggt ggtggcacaa 

aagtagcaag agctagtttg 

gtggagcaaa gaaatggaat 

tgttattctg atctttaact 

attgtagaaa tagtataatc 

tagggagggt ttaaggccat 

aaggtgagat aaagcaacaa 

agattaaaaa cgcatgtaat 

ggttgttgtt aagcctttaa 

atgtggaaaa aatagtggcc 

aaaaaaaaaa aagaggccga 

gaggcgggcg gatcacgagg 

agtctctacg aaaaatacaa 

ctacctggga ggctgaggca 

ccgagatcct gccactgcac 
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aagactccgt ctcaaaaaag aaaaaagaaa gaaaaacaaa agaaaacttc attgtattgt 
8220 

aaggccaaga acaaaatata tcaagataag gaaaatttgt agtcaagaat agaaaaaaat 
8280 

tatggctttg aagtatgagt tatttaaaga aagtggaaac atcctcagac tatgcagtaa 
8340 

aaaacaaagt gattttcttc ttctaaactt atgcaataaa ctgataggta atatgtgaaa 
8400 

gtcatagaat gtagactaga ggatacaaca aacctatttc ctctatgttc ataagaagta 
8460 

agaaaagctc tgatgtgagt tagcattgct ttacaatttt gaattgtgca gattgcacgt 
8520 % 

acttttcctc agtttgaagt aaatagtgga caggaaaaaa tattaaatgt tggcagtaaa 
8580 

tatggaagga aattacaact aatgtaatat gctaaaacat gctatgttta ttttactaat 
8640 

ttgaattaaa atgtaagaat ttaaaatgcc ctggaaaaac acgggcattg atctgacgtc 
8700 

tgaagtttta aaatattaca cactttgaaa tagcatttgt accttgaaat acctgtctct 
8760 

atatattttt taaaacttcc tttttctttc attccattta tcatcaaata aaggatgaac 
8820 

agatgtaact cagaaactgt caagcatgct gaagaaagac cactgcagaa aaatttctcc 
8880 

tagccttttc aaaggtgtta ggaagcagaa aggtgataca gaattggaga ggtcggagtt 
8940 

tttgtattaa ctgtattaaa tgcgaatccc gagaaaattt cccttaacta cgtcctgtag 
9000 

ttatatggat atgaagactt atgtgaactt tgaaagacgt gtctacataa gttgaaatgt 
9060 

ccccaatgat tcagctgatg cgcgtttctc tacttgccct ttctagagag gtgcaacgga 
9120 

agccagaaca ttcctcctgg aaattcaacc tgtttcgcag tttctcgagg aatcagcatt 
"9180 
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cagtcaatcc gggccgggag cagtcatctg tggtgaggct gattggctgg ccaggaacag 
9240 

cgccggggcg tgggctgagc acagccgctt cgctctcttt gccacaggaa gcctgagctc 
9300 

attcgagtag cggctcttcc aagctcaaag aagcagaggc cgctgttcgt ttcctttagg 
9360 

tctttccact aaagtcggag tatcttcttc caaaatttca cgtcttggtg gccgttccaa 
9420 

ggagcgcgag gtaggggcac gcaaagctgg gagctactat gggacagttc ccaagtgtca 
9480 

ggctttcaga tttcctgaac ttggtcttca cgggagaagg gcttcttgag gcgtggatag 
9540 

i 

tgtgaagtcc tctggcaagt ccatggggac caagtggggt tagatctaga ctcaggagct 
9600 

■ cctggagcag cgcccaaacc gtagtggcac tggaccatgt tgcccggagc gcgcacagcc 
9660 

cgcgcggtgc ggggacctgc tctctgagcc cgcgggcggt gggtgggagg aagcatcgtc 
9720 

cgcggcgact ggaaccggga gggagaatcg cactggcggc gggcaaagtc cagaacgcgc 
9780 

tgccagaccc ccaactctgc cttcgtggag atgctggaga ccccgcgcac aggaaagccc 
9840 

ctgcagtgcc catc'gcggcc agagcagctg gggcatcaac ggcgggcgct ccctcttact 
9900 

gctctctggc ttcgacgggg gactagaggt tagtctcacc tccagcgcgc ctgaggctca 
9960 

tgcatttggc taatgagctg cggtttctct tcaggtcgga atggatcttg aaggggaccg 
10020 

caatggagga gcaaagaaga agaacttttt taaactgaac aataaaaggt aactagcttg 
10080 

tttcattttc atagtttaca tagttgcgag atttgagtaa tttatttcta gcctccagct 
10140 

ctgaaataaa tgacatgttg ttgtttttaa ttatttttaa gaaacgcaag ctagcctttg 
10200 
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